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Recombinational Cloning Using 
Engineered Recombination Sites 



PCT/US96/10082 



Background of the Invention 
Field of the Invention 

The present invention relates to recombinant DNA technology. DNAand 
vectors having engineered recombination sites are provided for use in a 
recombinational cloning method that enables eflScient and specific recombination 
of DNA segments using recombination proteins. The DNAs, vectors and 
methods are usefiil for a variety of DNA exchanges, such as subcloning of DNA, 
in vitro or in vivo. 

Related Art 

Site specific recombinases. Site specific recombinases are en2ymes that 
are present in sorrie viruses and bacteria and have been characterized to have both 
endonuclease and Ugase properties. These recombinases (along with associated 
proteins in some cases) recognize s[>ecific sequences of bases in DNA and 
exchange the DNA segments flanking those segments. The recombinases and 
associated proteins are collectively referred to as "recombination proteins" {see, 
e.^.„ Landy, A.y Current ppinion in Biotechnology 3 .6^*701 (1993)). 

Numerous recombination systems firom various organisnis have been 
described. See, e.g., Hoess et aL, Nucleic Acids Research 14i€):22Z7 (1986); 
Abremski et al., J. Biol Chem.261 il):391 (1986); Campbell, J. 
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BacterioL174{23):7495 (1992); Qian era/., ^ BioL Chenu 267(11):7794 (1992); 
AraM et al.,J. Mol Biol 22J(1):25 (1992); Maeser and Kahnmann (1991) Mol 
Gen Genet 250:170-176). 

Many of these belong to the integiase family of recombinases 
(Argos etaL EMBOJ. 5:433-440 (1986)). Perfi^s tiie best studied of these are 
the Integrase/o/^ system fix>m bacteriophage k (Landy, A. Current Opinions in 
Genetics ondDevel 5:699-707 (1993)), the Cie/loxP system from bacteriophage 
PI (Hoess and Abremski (1990) In Nucleic Adds cpuI Molecular Biology^ vol. 4. 
Eds.: Eckstein and Lilley, Berlin-Heidelberg: Springer-Verlag; pp. 90-1 09) , and 
the FLP/FRT system from the Saccharomyces cerevisiae 2 \i circle plasmid 
(Broach et al Cell 2P:227-234 (1982)). 

While these recombixmtion systems have been characterized for particular 
organisms, the related art has only taught using recombinant DNA flanked by 
recombination sites, for in vivo recombination, 

Backman (U.S, Patent No. 4,673,640) discloses the in vivo use of X 
recombinase to recombine a protdn producing DNA segment by enzymatic site- 
specific recombination using wild-type recombination sites attB and attP. 

Hasan and Szybalski (Gene Jtf:145-151 (1987)) discloses the use of X Int 
recombinase in vivo for intramolecular recombination between wild type attP 
and attB sites \^ch flank a promoter. Because the orientations of these sites are 
inverted relative to each other, this causes an irreversible flipping of ^e promoter 
region relative to the gene of interest 

Palazzolo et al Gene 55:25-36 (1990), discloses phage lambda vectors 
having bacteriophage X aims that contain restriction sites positioned outside a 
cloned DNA seqiiaice and between wild-type IcocP sites. Iiffection of K coli cells 
that express the Cre recombinase with these phage vectors results in 
recombination between tiie loxP sites and tihe in vivo excision of &e plasmid 
repUcon, including the cloned cDNA. 

P6sfei et al. (Nucl Acids Res. 22:2392-2398 (1994)) discloses a method 
for inserting into genoniic DNA partial e>qpTession vectors having a selectable 
maxkor, flanked by two wfld-type FRT recognition sequences. FLB site-specific 
recombinase as preset in the cells is used to integrate tiie vectors into the 
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genome at predetermined sites. Under conditions where the replicon is 
functional, this cloned genomic DNA can be amplified. 

Bebee et al (U.S. Patent No. 5,434,066) discloses the use of site-specific 
recombinases such as Cre for DNA containing two loxP sites is used for in vivo 
recombination between the sites. 

Boyd {Nucl Acids Res. 27:817-821 (1993)) discloses a method to 
facilitate the cloning of blunt-ended DNA using conditions that encoiirage 
intermolecular ligation to a dephosphorylated vector that contains a wild-type 
loxP site acted upon by a Cre site-specific recombinase present in E. coli host 
cells. 

Wateihousec/a/. (PCTNo. 93/19172 andM/c/e/c^c/tfe/?e5. 21 (9):2265 
(1 993)) disclose an in vivo method i^ere light and heavy chains of a particular 
antibody were cloned in different phage vectors between loxP and loxP 511 sites 
and used to transfect new E, coli cells. Cre, acting in the host cells on the two 
parental molecules (one plasmid, one phage), produced four products in 
equilibrium: two diflFerent cointegrates (produced by recombination at either loxP 
or loxP 511 sites), and two daughter molecules, one of which was the desired 
product 

In contrast to flie other related art, Schlake & Bode {Biochemistry 
55:12746-12751 (1994)) discloses an in vivo method to^xchange expression 
cassettes at defined chromosomal locations, each flanked by a wild type and a 
spacer-mutated FRT recombination site, A double-reciprocal crossover was 
mediated in cultured mammalian cells by using this FLP/FRT system for site- 
specific recombination. 

Transposases. The family of enzymes, the transp&sases, has also been 
used to transfCT genetic information between replicons. Tranq>osons are 
structuraUy variable, being described as simple or compound, but typically 
encode the recombinase gene flanked by DNA sequences organized in inverted 
orientations. Integration of transposons can be random or highly specific. 
Representatives such as Tn7, \^ch are higjily ate-spedific, have been ^plied to 
the in vivo movement of DNA segments between replicons (Lucklow et cd,y 
J. Virol 57:4566-4579 (1993)). 
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Devine and Boeke Nucl Acids Res, 22:3765-3772 (1994), discloses the 
construction of artificial transposons for the insertion of DNA segments, in vitro ^ 
into recipient DNA molecules. The system makes use of tfie integrase of yeast 
TYl virus-like particles. The DNA segment of interest is cloned, usmg standard 
methods, between the ends of the transposon-like element TYl . In the presence 
of the TYl integrase, the resulting element integrates randomly into a second 
target DNA molectile, 

DNA cloning. The cloning of DNA segments currently occurs as a daily 
routine in many research labs and as a prerequisite step in many genetic analyses. 
The purpose of these clonings is various, howevar, two general purposes can be 
considered: (1) the initial cloning of DNA fix)m large DNA or RNA segments 
(chromosomes, YACs, PGR fragments, mKNA, etcO, done in a relative handfW 
of known vectors such as pUC, pGem, pBIueScript, and (2) flie subcloning of 
these DNA segments into specialized vectors for functional analysis. A great 
deal of time and eflTort is expended both in the initial cloning of DNA segments 
and in the transfer of DNA segments from the initial cloning vectors to the more 
specialized vectors. This transfer is called subcloning. 

The basic methods for cloning have been known for many years and have 
changed litde during that time. A typical cloning protocol is as follows: 

(1) digest the DNA of interest with pne or two restriction 
enzymes; 

(2) gel purify the DNA segment of interest vibsn known; 

(3) prepare tibe vector by cutting with appropriate restriction 
enzymes, treating with alkaline phosphatase, gel purify ^c, as 
^propriate; ; 

(4) ligate the DNA segment to vector, with appropriate controls 
to estimate background of uncut and self-ligated vector; 

(5) intrpduce the resulting vector into an £ coli host cell; 

(6) pick selected colonies and grow small cultures overnight; 

(7) inake DNA minipreps; and 

(8) analyze ^e isolated plasmid on agarose gels (often after 
diagnostic restriction enzyme digestions) or by PGR. 
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The specialized vectors used for subcloning DNA segments are 
functionally diverse. These include but are not limited to: vectors for expressing 
genes in various organisms; for regulating gene expression; for providing tags to 
aid in protein purification or to allow tracking of proteins in cells; for modifying 
the cloned DNA segment (e.g.^ generating deletions); for the synthesis of probes 
(e.g., riboprobes); for the pr^aration of templates for DNA sequencing; for the 
identification of protein coding regions; for the fusion of various protein-coding 
regions; to provide large amounts of the DNA of interest, etc. It is common that 
a particular investigation will involve subcloning the DNA segment of interest 
into several different specialized vectors. 

As known in the art, simple subclonings can be done in one day (e.g., the 
DNA segment is not large and the restriction sites are compatible with those of 
the subcloning vector). However, many other subclonings can take several 
weeks, especially those involving unknown sequences, long fi:agments, toxic 
genes, unsuitable placement of restriction sites, high backgrounds, impure 
euTymes, etc, Subcloning DNA fi:agments is thus often viewed as a chore to be 
done as few times as possible. 

Several methods for facilitating the cloning of DNA segments have been 
described, e.g., as in the following references. 

Ferguson, J., et al Gene 75:191 (1981), discloses » family of vectors for 
subcloning fi:agments of yeast DNA, The vectors encode kanamycin resistance. 
Clones of longo: yeast DNA segmesits can be partially digested and ligated into 
the subcloning vectors. If the original cloning vector conveys resistance to 
ampicillin, no piirificafion is necessary prior to transformation, since the select 
will be for kanamycin. - 

Hashimoto-Gotoh, T-, et ah Gene 41:125 (1986), discloses a subcloning 
vector with imique- cloning sites, within a streptomycin sensitivity gene; in a 
streptomycin-resistant host, only plasmids with inserts or deletions in the 
dominant sensitivity gene will survive streptomycin selection. 

Accordingly, traditional subcloning methods, using restriction enzymes 
and ligase, are time consuming and relatively unreliable. Considerable labor is 
expended, and if two or more days later the desired subclone can not be found 
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among the candidate plasmids, the entire process must then be repeated with 
alternative conditions attempted. Although site specific recombinases have been 
used to recombine DNA in v/vo, the successful use of such enzymes in vitro was 
expected to suflFer from several problems. For example, the site specificities and 
efficiencies were e?q>ected to dififer m vitro; topologically-linked products were 
expected; and the topology of the DNA substrates and recombination proteins 
was expected to dififer significantly in vitro (see, e.g,^ Adams et al, J, Mol 
Biol 226:661-73 (1992)). Reactions that could go on for many hours in vivo 
were expected to occur in significantly less time in vitro before the enzymes 
became inactive. Multiple DNA recombination products were expected in the 
biological host used, resulting in unsatis&ctory reliability, specificity or 
efficiency of subcloning. In vitro recombination reactions were not expected to 
be sufficiently efficient to yield the desired levels of product 

Accordingly, there is a long felt need to provide an alternative subcloning 
system that provides advantages over the known use of restriction en^ntnes and 
ligases. 

Summary of the Invention 

The preset invention provides nucleic acid, ^*ctors and methods for 
obtaining chimeric nucleic acid using recombination proteins and engineered 
recombination sites, fn vf/^-o or m vrvo. These methods arc highly ispecific, rapid, 
and less labor intensive than what is disclosed or suggested in the related 
background art The improved specificity, speed and yields of the present 
inv^tion fedlitates DNA or RNA subcloning, regulation 6r exchange usefiil for 
any related purpose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vfva insertion or modification of transcribed, 
replicated, isolated or genoxhic DNA or RNA. 

The present invraticm relates to nucleic acids, vectors and meffiods for 
moving or exchanging segnlents of DNA using at least one engineered 
recombination site and at least one recombinatidn protein to provide chimeric 
DNA molecules vMch have the desired cfaaractaistic(s) and/or DNA seg^^€nt(s). 
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Generally, one or more parent DNA molecules are recombined to give one or 
more daughter molecules, at least one of which is the desired Product DNA 
segment or vector. The invention thus relates to DNA, RNA, vectors and 
methods to effect the exchange and/or to select for one or more desired products. 

One embodiment of the present invention relates to a method of making 
chimeric DNA, which comprises 

(a) combining fn vi/ro or i« vrvo 

(i) an Insert Donor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites do not recombine with each 
other; 

(ii) a Vector Donor DNA molecule containing a third 
recombination site and a fourth recombination site, wherein the third and fourth 
recombination sites do not recombine with each other; and 

(iii) one or more site specific recombination proteins capable 
of recombming the first and third recombinational sites and/or the second and 
fourth recombinational sites; 

thereby allowing recombinatioii to occur, so as to produce at least one 
Cointegrate DNA molecule, at least one desired Product DNA molecule which 
comprises said desired DNA segment, and optionally a Byproduct DNA 
molecule; and then, optionally, 

(b) selecting for the Product or Byproduct DNA molecule. 

Another embodiment of flie present invention relates to a kit comprising 
a carrier or receptacle being compartmentalized to receive and hold therein at 
least^ one container, wherein a first container contains a DNA molecule 
comprising a vector having at least two recombination sites flanking a cloning 
site or a Selectable inarker, as described ha?ein. The kit optionally fiirther 
comprises: 

(i) a second container containing a Vector Donor plasmid 
comprising a subcloiiing vector and/or a Selectable marker of ^ch one 
or both are flanked by one or more engineered recombination sites; and/or 
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(ii) a tbird container containing at least one recombination protein 
Avhich recognizes and is capable of recombining at least one of said 
recombination sites. 

Other embodimoits include DNA and vectors usefiil in the me&ods of the 
present invention. In particular. Vector Donor molecules are provided in one 
embodiment, \^erein DNA segments widiin the Vector Donor are separated 
either by, 0) iii a circular Vector Donor, at least two recombination sites, or (ii) 
in a linear Vector Donor, at least one recombination site, \^ilere the recombination 
sites are preferably engineered to enhance specificity or efficiency of 
recombination. 

One Vector Donor embodiment comprises a first DNA segment and a 
second DNA segment, the first or second segment comprising a Selectable 
marker. A second Vector Donor embodiment comprises a first DNA segment 
and a second DNA segment, the first or second DNA segment comprising a toxic 
gene. A third Vector Donor embodiment comprises a first DNA segment and a 
second DNA segment, the first or second DNA segment comprising an inactive 
fragment of at least one Selectable marker, wherein the inactive fi:agment of the 
Selectable marker is enable of reconstituting a functional Selectable marker 
when recombined across the first or second recombination site with another 
inactive fiagmentofat least one Selectable marker. * 

The presCTt recombinational cloning method possesses several advantages 
over previous in vfvp methods. Since sin^e molecules of recombination products 
can be introduced into a biological host, propagation of the des 
in the abs^ce of other DNA molecules {e.g^ y starting molecules, intermediates, 
by-products) is more readily realizedL Reaction conditions can be fieely 
adjusted in vitro to optimize mzyme activities. DNA mbleculies can be 
inconq>atible \vi&&e desired biolo^<^ ho^ 

can be used. Recombination proteins fiom diverse sources can be employed, 
together or soquentially* 

Other embodiments will be evident to those of ordinary skill in the art 
fiqm the trarliiuBS contained herein in combinatioii wifli vAvst is known to tfie art 
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Brief Description of the Figures 



Figure 1 depicts one general method of the present invention, wherein the 
starting (parent) DNA molecules can be circular or linear. The goal is to 
exchange the new subcloning vector D for the original cloning vector B. It is 
desirable in one embodiment to select for AD and against all the other molecules, 
including tiie Cointegrate. The square and circle are sites of recombination: e.^., 
loxP sites, att sites, etc. For example, segment D can contain expression signals, 
new drug markers, new origins of replication, or specialized functions for 
moping or sequencing DNA. 

Figure 2A depicts an in vitro method of recombining an Insert Donor 
plasmid (here, pEZC705) with a Vector Donor plasmid (here, pEZC726), and 
obtaining Product DNA and Byproduct daughter molecules. The two 
recombination sites are attP and loxP on the Vector Donor. On one segment 
defined by these sites is a kanamycin resistance gene whose promoter has been 
replaced by the tetOP operator/promoter fi-om transposon TwlO. See Sizemore 
et al, Nucl Acids Res. J5(10):2875 (1990). In the absence of tet repressor 
protein, K coli RNA polymerase transcribes flie kahainycin resistance gene from 
the tetOP. If tet repressor is present, it binds to tetOP and blocks transcription of 
the kanamycm resistance gene. The other segmi^ht of PEZC726 has the tet 
repressor gene expressed by a constitutive promoter. Thus cells transformed by 
pEZC726 are resistant to chloramphenicol, because of tlie chloramphenicol acetyl 
transferase gene on the same segment as tetRj but are sensitive to kanamycin. 
The recombinase-mediated reactions result in separation of the tetR gene fix)m the 
regulated kan^ycin resistance gene. This separation results in kanamycin 
resistance in cells receiving only the desired recombination products. The first 
recombination reactioii is driven by the addition of the recombinase called 
Integrase. The second recombination reaction is driven by adding the 
recombuiase Ore to the Coinfegrate (here, pEZC7 Cointegrate). 
Figure 2B depicts a restriction map of pEZC705. 
Figure 2C depicts a restriction m^ 6iFpEZC726. 
Figure 2D depicts a restriction map of pEZC7 Cointegrate. 
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Figure 2E depicts a restriction map of InqjrocL 
Figure 2F depicts a restriction map of Intbypro. 

Figure 3A depicts an m vitro method of recombining an Insert Donor 
plasmid (here, pEZC602) with a Vector Donor plasmid (here, pEZC629), and 
obtaining Product (here, EZC6prod) and Byproduct (here, EZC6Bypr) daughter 
molecules. The two recombination sites are /oxP and /oxP 57i. One segment of 
pEZC629 defined by these sites is a kanamycin resistance gene whose promoter 
has been replaced by the tetOP op^tor^romoter firom transposon Tnl 0. hi the 
absence of tet repressor protein, R coli RNA i>olymerase transcribes the 
kanamycin resistance gene fix>m the tetOP. If tet repressor is present, it binds to 
tetOP and blocks transcription of the kanamycin resistance gene. The oihsx 
segmrat of pEZC629 has the tet repressor gene cjqjressed by a constitutive 
promoter. Thus cells transformed by pEZC629 are resistant to chloramphenicol, 
because of the chloramphenicol acetyl transferase gene on the same segment as 
/e//?, but are sensitive to kanamycin. The reactions result in separation of tiie/e/i{ 
gene firom the regulated kanamycin resistance gene. This separation results in 
kanamycin resistance in cells receiving the desired recombination product The 
first and the second recombination events are driven by tiie addition of the same 
recombinase, Cre. 

Figure 3B depicts a restriction m^ of EZC6B3(pr. 

Figure 3C depicts a restriction map of EZC6prod. 

Figure 3D depicts a restriction map of pEZC602. 

Figure 3E depicts a restriction map of pEZC629. 

Figure 3F depicts a restriction map of EZC6coint. 

F^ure 4A depicts an ^plication of the i/i vitro mediod of 
recoinbinadonal cloning to subclone the chloranq>henicol acetyl tran^eiase gene 
into a vector for e7q>re5sion in eukaryotic cdls. Tte Insert Donor plasmid, 
pEZC843, is conqxrised of the chloramphenicol acetyl transferase gCTe of £1 co/i, 
cloned between loxP znd attB sites such that Ae loocP site is positioned at the 
5'-end of the gene. The Vector Donor plasmid, pEZC1003, contains the 
cytomegalovirus eukaiyotic promoter apposed to a loxP site. The siq)ercoiled 
plasmids were combined with lambda Integrase and Cre recombinase in vitro. 
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After incubation, competent E, coU cells were transfonned with the 
recombinational reaction solution. Aliquots of transformations were spread on 
agar plates containing kanamycin to select for the Product molecule (here 
CMVProd). 

Figure 4B depicts a restriction map of pEZC843- 

Figure 4C depicts a restriction map of pEZClOOS. 

Figure 4D depicts a restriction map of CMVBypro. 

Figure 4E depicts a restriction map of CMVProd. 

Figure 4F depicts a restriction map of CMVcoint 

Figure 5A dq>icts a vector diagram of pEZC 1301. 

Figure SB depicts a vector diagram of pEZC1305. 

Figure 5C depicts a vector diagram of pEZC 1 309. 

Figure 5D depicts a vector diagram of pEZG1313. 

Figure 5E depicts a vector diagram of pEZC1317. 

Figure 5F depicts a vector diagram of pEZC 1 32 1 . 

Figure 5G depicts a vector diagram of pEZC1405. 

Figure 5H depicts a vector diagram of pEZC1502. 

Figure 6A depicts a vector diagram of pEZG 1 603. 

Figure 6B depicts a vector diagram of pEZC1706. 

Figure 7A depicts a vector diagram of pEZC290U 
Figure 7B depicts a vector diagram of pEZC2913 
Figure 7C depicts a vector diagram of pEZC3 101. 
Figure 7D depicts a vector diagram of pEZCl 802. 
Figure 8A depicts a vector diagram of pGEX-2TK:. 
Figure 8B depicts a vector diagram of pEZC3501 ; 
Figure 8C depicts a vector diagram of pEZC360L 
Figurje 8D depicts a vector diagram of pEZC3609. 
Figure 8E depicts a vector diagram of pEZC3617. 
Figure 8F depicts ai vector diagram of pEZC3606. 
Figure 8G depicts a vector diagram of pEZC36 13. 
Figure 8H depicts a vector diagram of pEZC3621 . 
Figure 81 depicts a vector diagram of GST-CAT. 
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Figure 8J depicts a vector diagram of GST-phoA. 
Figure 8K depicts a vector diagram of pEZC3201 . 

Detailed Description of the Preferred Embodiments 



It is unexpectedly discovered in the present invention that subcloning 
reacticms can be provided using recombinational cloning. Recombination cloning 
accoitling to the present invention uses DNAs, vectors and methods, in vitro and 
in vivo, for moving or exchanging segments of DNA molecules using engineered 
recombination sites and recombination proteins. These methods provide chimeric 
DNA molecules that have the desired characteristic(s) and/or DNA segment(s). 

The present invention flius provides nucleic acid, vectors and methods for 
obtaining chim^c nucleic acid using recombination proteins and engineered 
recombination sites, in vitro or in vivo. These methods are highly specific, rapid, 
and less labor intensive than what is disclosed or suggested in the related 
background art. The improved specificity, speed and yields of the present 
invention fecilitates DNA or RNA subcloning, regulation or exchange usefiil for 
any related purpose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vivo ins^on or modification of transcribed, 
replicated, isolated or genomic DNA or RNA. * 

Definitions 

In the descrq>tion fhat follows, a nunober of terms used in recombinant 
DNA technology iare utilized «ctenavely. In ord^ to provide a clear and 
consistent understanding of ^e specification and claims, including the scope to 
be given sudi terms, jflie foBoWing defimtions are provided. 

Byproduct is a daiigliter molecule (a new clone produced after the 
second recombination event during the recombmationa[ cloning process) lacking 
the DNA \^ch is desired to be subcloned. 
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Cointegrate: is at least one recombination intermediate DNA molecule 
of the present invention that contains both parental (starting) DNA molecules. 
It will usually be circular. In some embodiments it can be linear. 

Host: is any prokaryotic or eukaryotic organism that can be a recipient 
of the recombinational cloning Product A "host," as the term is used herein, 
includes prokaryotic or eukaryotic organisms that can be genetically engineered. 
For examples of such hosts, see Maniatis et al. Molecular Cloning: A Laboratory 
Manual Cold Spring Harbor Laboratory, Cold Spruig Harbor, New York (1982). 

Insert: is the desired DNA segment (segments of Figure 1) \\Wch one 
wishes to manipulate by the method of the present invention. The insert can have 
one or more genes. 

/ Insert Donor: is one of the two parental DNA molecules of the present 

uivention which carries the Insert The Insert Donor DNA molecule comprises 
the Insert flanked on both sides with recombination signals. The Insert Donor 
can be linear or circular. In one embodiment of the invention, the Insert Donor 
is a circular DNA molecule and further comprises a cloning vector sequence 
outside of the recombination signals (see Figure 1 ). . 

Product: is one or both the desired daughter molecules comprising the A 
and D otB and C sequences ^iuch are produced after the second recombination 
event during the recombinational cloning process (see Figure 1). The Product 
contains the DNA A^ch was to be cloned or subcloned. 

Promoter: is a DNA sequence generally described as the 5 -region of a 
gene, located proximal to the start codon* The transcription of an adjacent DNA 
segment is initiated at the promoter region. A repressible promoter's rate of 
transcription decreases in response to a repressing agent Ah inducible promoter's 
rate of transcription increases in response to an inducing agent A constitutive 
promoter's rate of transcription is not specifically regulated, though it can vary 
under the influence of general metabolic conditions. 
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Recognition sequence: Recognition sequences are particular DNA 
sequences which a protein, DNA, or RNA molecule (e.g., restriction 
endonuclease, a modification metiiylase, or a recombinase) recognizes and binds. 
For exan^le, the recognition sequmce for Cre recombinase is loxP vAAdx is a 34 
base pair sequence comprised of two 1 3 base pair inverted repeats (serving as the 
recombinase binding sites) flanking an 8 base pair core sequmce. See Figure 1 
of Sauer, B., Current Opinion in Biotechnology 5:521-527 (1994). Other 
examples of recognition sequences are the ottBy OttP^ attL, and ottR sequences 
which are recognized by the recombinase enzyme X Integrase. ottB is an 
apprpximately 25 base pair sequence containing two 9 base pair core-type Int 
binding sites and a 7 base pair overly region. attP is an approximately 240 base 
pair sequence containing core-type Int binding sites and arm-tyi>e Int binding 
sites as well as sites for auxiliary proteins IHF^FIS, and Xis. See Landyy Current 
^Opinion in Biotechnology 5:699-707 (1993). Sudi sites are also engineered 
according to the present invention to enhance methods and products. 

Recombinase: is an en^mie vdiich catalyzes the exchange of DNA 
segments at specific recombination sites. 

Recombinational Cloning: is a method described hi^in, \^ereby 
segmCTts of DNA molecules are exchanged, inserted, replaced, substituted or 
modified, in vitro or in vivo. 0 

Recombination proteins: include exdsive or integrative proteins, 
, enzymes, co-factors or associated proteins that are involved in recombination 
reactions involving one or more reccmibination sites. iStee^ Landy (1994), //j/f-a. 

. Repression cassette: is a DNA segment that contains a repressor of a 
Selectable niarker present in the subcloning vert ' 

Selectable marken is a DNA segm^ that allows one to select for or 
against a molecule or a cell that contains it» often under particular conditions. 
These markers can encode an activity, such as^ but not limited to, production of 
RNA, peptide, or protein, or can provide a binding site for RNA, peptides^ 
proteins, inorganic and organic compounds or compositions and the like. 
Examples of Selectable markers include but are not limited to: (1) DNA 
segmrats that encode products ^iuch provide resistance against otherwise toxic 
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compounds {e.g,^ antibiotics); (2) DNA segments that encode products v^ilich are 
otherwise lacking in the recipient cell (e.^., tRNA genes, auxotrophic markers); 
(3) DNA segments that encode products which suppress the activity of a gene 
product; (4) DNA segments that encode products which can be readily identified 
(e.g:, phenotypic markers such as p-galactosidase, green fluorescent protein 
(GFP), and cell surface proteins); (5) DNA segments that bind products vAAch. are 
otherwis&detrin)ental to cell survival and/or function; (6) DNA segments that 
otherwise inhibit the activity of any of the DNA segments described in Nos. 1-5 
above (e.g:, antisense oligonucleotides); (7) DNA segments that bind products 
that modify a substrate (e.g, restriction endonucleases); (8) DNA segments that 
can be used to isolate a desired molecule {e g. specific protein binding sites); (9) 
DNA segmmts that encode a specific nucleotide sequence wiiich can be 
otherwise non-functional (e.g., for PGR amplification of subpopulations of 
molecules); and/br (10) DNA segments, which when absent, directly or indirectly 
confer sensitivity to particular compounds. 

Selection scheme: is aiiy method ^^ch allows selection, enrichment, or 
identification of a desired Product or Product(s) firom a mixture containing the 
Insert Donor, Vector Donor, and/or any intermediates, (e.g. a Cointegrate) 
Byproducts. The selection schemes of one preferred embodiment have at least 
two components that are either linked or unlinked dicring recombinational 
cloning. One component is a Selectable marker. The other component controls 
the expression in vitro or in vivo of the Selectable marker, or survival of the cell 
harboring the plasmid carrying the Selectable maiicer. Generally, this controlling 
element will be a repressor or inducer of the Selectable marker, but other means 
for controlling expression of the Selectable madcer can 'be used. Whether a 
rq>ressor or activator is used wiU depend on \^ether the m^ 
or negative selection, and the exact arrangement of the various DNA segments, 
as will be readily {qyparent to tiiose skilled in the art A preferred requirement is 
that tiie selection scheme results in selection of or enrichment for only one or 
more desired Products. As defined herein, to select for a DNA molecule incltides 
(a) selecting or enriching for the presence of the desired DNA molecule, and (b) 
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selecting or enriching against the presence of DNA molecules that are not the 
desired DNA molecule. 

In one embodiment, the selection schemes (ii^ch can be carried out 
reversed) will take one of three forms, Mdiich will be discussed in temis of 
Figure 1 . The first, exemplified herein with a Selectable marker and a repressor 
therefor, selects for molecules having segment 2> and laddng segment C The 
second selects against molecules having segment C and for molecules having 
segment Possible embodiments of the second form would have a DNA 
segment carrying a gene toxic to cells into whidi the in vitro reaction products 
are to be introduced. A toxic gene can be a DNA that is e^qxressed as a toxic gene 
product (a toxic protein or RNA), or can be toxic in and of itself. (In the latter 
case, the toxic gene is understood to carry its classical definition of "heritable 
trait".) 

Examples of such toxic gene products are well known in the art, and 
include, but are not limited to, restriction endonucleases (eg:, DpnJ) and genes 
that kill hosts in the absence of a siq)pressing fimctioii,e.g:,A/c5. A toxic gene 
can alternatively be selectable in vitro, e.g. , a restriction site. 

In the second forni, segment 1> carries a Selectable marker. The toxic 
gene would eliminate transformants harboring the Vector Donor, Cointegrate, and 
Byproduct molecules, vAnlo the Selectable marker can b# used to select for cells 
containing the Product and' agaiiist cells harboriiig oiily tfie Iiisert Donor. 

The thinl fonn selects for ceUs tfiat have both segmCTtsyl and Z> in m on 
the same molecule, but not for cells that have both segmrats in trans on different 
molecules. This could be embodied by a Selectable marker that is split into two 
inactive firagmmts, one each on segments A and D, ' 

The fragments are so arranged relative to the recombination sites that 
Tf^^iOT the segments are brought togetfm by the recombination event, they 
reconstitute a functional Selectable marker. For example, the recombinational 
event can link a promoter with a structural gene, can link two fragments of a 
structural gene, or can link genes that encode a hieterodimCTc gene product 
needed for survival, or can lirik portioiis of a replicQTL 
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Site-sjwcific recombinase: is a type of recombinase which typically has 
at least the following four activities: (1) recognition of one or two specific DNA 
sequences; (2) cleavage of said DNA sequence or sequences; (3) DNA 
topoisomerase activity involved in strand exchange; and (4) DNA ligase activity 
to reseal the cleaved strands of DNA. See Sauer, B., Current Opinions in 
Biotechnology 5:521-527 (1994). Conservative site-specific recombination is 
distinguished fix>m homologous recombination and transposition by a high degree 
of specificity for both partners. The strand exchange mechanism involves the 
cleavage and rejoining of specific DNA sequences in the absence of DNA 
synthesis (Landy, A, (1989) Ann. Rev. Biochent 55:913-949). 

Subcloning vector: is a cloning vector comprising a circiilar or linear 
DNA molecule v^ch includes an appropriate replicon. In the present invention, 
the subcloning vector (segment D in Figure 1) can also contain fimctional and/or 
regulatory elements that are desired to be incorporated into the final product to 
act upon or with the cloned DNA Insert (segmental in Figure 1). The subcloning 
vector can also contain a Selectable marker (contained in segment C in Figure 1). 

Vector: is a DNA that provides a usefiil biological or biochemical 
property to an Insert Examples include plasmids, phages, and other DNA 
sequences which are able to replicate or be replicated in vitro or in a host cell, or 
to convey a desired DNA segment to a desired location^thin a host cell. A 
Vector can have one or more restriction endpnuclease recognition sites at which 
the DNA sequences can be cut in a . determinable fashion without loss of an 
essential biological fimction of the vector, and into which a DNA fiagment can 
be spliced in order to bring about its replicatipn and cloning. Vectors can further 
provide primer sites, eg:, for PGR, transcriptional and/or translational initiation 
and/or regulation sites, recombinational signals, replicons. Selectable markers, 
etc. Cleariy, methods of inserting a desired DNA fiagment vAAch do not require 
the use of homologous recombin£itipn or restriction en2ymes (such as, but not 
limited to, UDG cloning of PGR fragments (U.S. Patent No. 5334,575, entirely 
incorporated Irarein by reference), T:A cloning, and the like) can also be applied 
to clone a fi:agm»t of DNA into a cloning vector to be used according to the 
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present invention. The cloning vector can further contain a Selectable marker 
suitable for use in the identification of cells transformed with the cloning vector. 

Vector Donon is one of the two parental DNA molecules of the present 
invention vAnch carries the DNA segments encoding the DNA vector ^^di is to 
become part of the desired Product The Vector Donor comprises a subcloning 
vector D (or it can be called the cloning vector if the Insert Donor does not 
aheady contain a cloning vector) and a segment C flanked by recombination sites 
(see Figure 1). Segments C and/or D can contain elements that contribirte to 
selection for the desired Product daughter molecule, as described above for 
selection schemes. The recombination signals can be the same or difTerent, and 
can be acted iq>on by the same or different recombinases. In addition, the Vector 
Donor can be linear or circular. 

Description 

One general schraie for an in vitro or in vivo method of the invention is 
shown in Figure 1 , where the Insert Donor and the Vector Donor can be either 
circular or linear DNA, but is shown as circular. Vector 2> is exchanged for the 
original cloning vector A, It is desirable to select for the daughter vector 
containing elem^ts A and D and against other moleculos, including one or more 
Cointegrate(s). The square and circle are different sets of recombination sites 
(e.g., lox sites or att sites). Segment^ or D can contain at least one Selection 
Marker, e^qnression signals, origins of replication, or specialized functions for 
detecting, selecting, ^q>ressing, mapping or sequencing DNA, vdiere D is used 
in this ^cample. 

Examples of desired DNA segments that can be part of Element^ or I> 
include, but are not limited to, PGR products, large DNA segments, genomic 
clones or fiagments, cDNA clones, fiinctional elemoxts, era, and genes or partial 
genes, whidi encode useful nucleic acids or proteins. Moreover, tiie 
recombinational cloning of ibe present invention can be used to make ex vivo and 
in vivo gene transfer vehicles for protein e^q^ression and/or gene therapy. 
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In Figure 1, the scheme provides the desired Product as containing 
vectors D and as follows. The Insert Donor (containing A and B) is first 
recombined at the square recombination sites by recombination proteins, with the 
Vector Donor (containing C and D), to form a Co-integrate hjaving each of A-D- 
C-B. Next, recombination occurs at the circle recombination sites to form 
Product DNA {A and D) and Byproduct DNA (C and B). However, if desired, 
two or more different Co-integrates can be formed to generate two or more 
Products. 

In one embodiment of the present in vitro or in vivo recombinational 
cloning method, a method for selecting at least one desired Product DNA is 
provided. This can be understood by consideration of ttie map of plasmid 
pEZC726 depicted in Figure 2. The two exemplary recombination sites are attP 
and loxP. On one segment defined by these sites is a kanamycin resistance gene 
whose promoter has been replaced by the re/OP operator/promoter fi-om 
transposon 7>il 0. In the absence of tet repressor protein^ R coli RNA polymerase 
transcribes the kanamycin resistance gene firom the tetOP. If tet repressor is 
present, it binds to tetOP and blocks transcription of the kanamycin resistance 
gene. The other segment of pEZC726 has the /e/ repressor gene e3q)ressed by a 
constitutive promoter. Thus cells transformed by pEZC726 are resistant to 
chloramphenicol, because of the chloramphenicol acetyl transferase gene on the 
same segment as tetRy but arc sensitive to kanamycin. The recombination 
reactions result in separation of the tetR gene firom the regulated kanamycin 
resistance grae. This separation results in kanamycin resistance in cells receiving 
the desired recombination Product. 

Two different sets of plasmids were constructed to demonstrate the 
in vitro method. One set, for use with Crc recombinase only (cloning vector 602 
and subcloning vector 629 (Figure 3)) contained loxP and loxP 511 sites. A 
second set, for use with Crc and integrase (cloning vector 705 and subcl^ 
vector 726 (Figure 2)) contained loxP and att sites. The eflSdency of production 
of the desired daughter plasmid was about 60 fold higher using both enzymes 
than using Cre alone. Nineteen of twenty four colonies ftom the Crc-only 
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rcaction contained the desired product, while thirty eight of thirty eight colonies 
from the integrase pins Cre reaction contained the desired product plasmid. 

Other Selection Schemes A variety of selection schemes can he used that 
are known in the art as they can suit a particular purpose for i^iiich the 
5 recombinational cloning is carried out Depending upon individual prefi^ences 

and needs, a number of different types of selection schemes can be used in the 
recombinational cloning method of the presait invention. The skilled artisan can 
take advantage of the availability of the many DNA segments or methods for 
making them and the difT^nt methods of selection that are routinely used in th^ 

10 art Such DNA segments include but are not limited to those which encodes an 

activity such as, but not limited to, production of RNA, pq^tide, or protein, or 
providing a binding site for such RNA, peptide, or protein. Examples of DNA 
molecules used in devising a selection scheme are given above, under the 
definition of "selection scheme" 

15 Additional examples include but are not limited to: 

(i) Generation of new primer sites for PGR {e.g., juxt^sition of two 
DNA sequences that were not previously juxtaposed); 

(ii) Inclusion of a DNA sequence acted iq>on by a restriction 
endonuclease or oth^ DNA modifying enzyme, chemical, 

20 ribo^me, etc.; * 

(iii) Inclusion of a DNA sequence recognized by a DNA binding 
protein, RNA, DNA, chemical, etc,) (e.g:, for use as an aflBnity 

, tag for selecting for or excluding from a pbpulation) ^>avis, Nud, 
Acids Res. 24:1(^2-106 (1996); J. Virol 69: 8027-8034 (1995)); 

25 (Iv) /n vi/ro selection of RNA Ugands for the riboson^ 

associated with Epstein-Barr virus-e?q>ressed RNA by using 
randomized and cDNA-derived RNA libraries; 
^ (vi) The positipriirig of functional elements ^^iiose activity ^ 

specific orientation or juxtaposition {eig., (a) a recombination ^te 

30 : \^ch reacts poorly in trans, but when placed in cis, in the 

presence of the ^prop riate proteins, results m Tecnmhfpfltion that 
destroys certain populations of molecules; {eg., reconstitution of 
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a promoter sequence that aJIows in vitro RNA synthesis). The 
RNA can be used directly, or can be reverse transcribed to obtain 
the desired DNA construct; 

(vii) Selection of the desired product by size {e.g., fractionation) or 
5 otherphysicalproperty ofthe molecule(s); and 

(viii) Inclusion ofa DNA sequence required for a specific modification 
{e.g., methylation) that allows its identification. 

After formation of the Product and Byproduct in the method of the present 
invention, the selection st^ can be carried out either in vitro or in vivo depending 
10 upon the particular selection scheme which has been optionally devised in the 

particular recombinational cloning procedure. 

For example, an in vitro method of selection can be devised for the Insert 
Donor and Vector Donor DNA molecules. Such scheme can involve engineering 
a rare restriction site in the starting circular vectors in such a way that after the 
15 recombination events the rare cutting sites end iq) in the Byproduct HOTce,when 

the restriction enzyme viblch binds and cuts at the rare restriction site is added to 
the reaction mixture in vitro, all of the DNA molecules carrying the rare cutting 
site, /.e., the starting DNA molecules, the Cointegmte, and flie Byproduct, will be 
cut and rendered nonreplicable in the intended host cell. For example, cutting 
20 sites in segments and C (see Figure I) can be use* to select agafaist all 

molecules except the Product Alternatively, only a cutting site in C is needed if 
one is able to select for segmentZ>, e.^., by a drug resistance gene not found onB. 

Similarly, an in vitro selection method can be devised when dealing with 
linear DNA molecules. DNA sequences complemehtaiy to a PGR primer 
25 sequence can be so engineered that they are transferred, through the 

recombinational cloning method, only to the Product molecule.- After the 
reactions are completed, the ^spropriate primers are added to the reaction solution 
and the sample is subjected to PGR. Hence, all or part of the Product molecule 
is amplified. 

3^ Other in vivo selection schemes can be used with a variety of K coli cell 

lines. One is to put a repressor gene on one segment of the subcloning plasmid, 
and a drug niarker contn>Ued by that repressor on the other segment of the sam 
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plasmid. Another is to put a kUIer gene on segment C of the subcloning plasmid 
(Figure I). Of course a way must exist for growing such a plasmid, ic, there 
must exist circumstances under which the killer gene will not kill. There are a 
number of these genes known which require particular strmns of £. coli. One 
such schone is to use the restriction aayms Dprd, vMch will not cleave unless 
its recognition sequence GATC is methylated. Many popular common R coli 
strains methylate GATC sequences, but Acre arc mutants in which cloned Dpnl 
can be expressed without harm. 

Of course analogous selection schemes can be devised for other host 
organisms. For example, the tet repressor/operator otTnIO has been adapted to 
control gene expression in eukaiyotes (Gossen, M., and Bujard, H., Proc. Natl. 
Acad Sci USA 5P:5547-5551 (1992)). Thus the same control of drug resistance 
by the tet repressor exemplified hoein can be applied to select for Product in 
eukaiyotic cells. 

Recombination Proteins 

In the present invention, tlje exchange of DNA segments is achieved by 
the use of recombination proteins, including recombinases and associated 

co-fe<tors and iMX)teins. Yariom recombination proteins w described i^ 
Examples of such recombinases include: 

Oer A protein fiom bacteriophage PI (Abronski aiid Hoess, J. BioL 
Chenu 25P(3):1509-1514 (1984)) catalyzes the exchange (i e.. causes 
recombination) between 34 bp DNA sequences called laxP (locus of crossover) 
sites (&e Hoess et aL^ Nucl. Acids Res. 14(S):22S7 (1986)). Cre is available 
commerdaUyCNovagen, Catalog No. 69247-1). Recombination mediated by Cie 
is fieely reversible. From flieimodynamic considerations it is not sti^prising that 
Cre-mediated integration (recombination between two molecules to fonn one 
molecule) is mudi less efficient than Cre-mediated excision (recombination 
, betwcCT two fa«P sites in &e same molecule to fonn two daughter molecules^^ 
Cie worics in sinq>le buffos with eiflier magnesium or spomldine as a co&ctot, 
as is weU known in the art. The DNA substrates can be either linear or 
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supercoiled. A number of mutant loxP sites have been described (Hoess et al , 
stipra). One of these, loxP 511, recombines with another loxP 511 site, but will 
not recombine with a loxP site. 

Integrase: A protein from bacteriophage lambda that mediates the 
integration of the lambda genome into the E. coli chromosome. The 
bacteriophage X Int recombinational proteins promote irreversible recombination 
between its substrate att sites as part of the formation or induction of a lysogenic 
state. Reversibility of the recombination reactions results from two independent 
pathways for integrative and excisive recombination. Each pathway uses a 
unique, but overlapping, set of the 1 5 protein binding sites that comprise att site 
DNAs. Cooperative and competitive interactions involving four proteins (Int, 
Xis, IHF 9nd HS) determine the direction of recombination. 

Integrative recombination involves the Int and IHF proteins and sites attP 
(240 bp) and attB (25 bp). Recombination results in the formation of two new 
sites: and arfi2. Excisive recombination requires Int, IHF, and Xis, and sites 
ottL and attR to generate attP and attB. Under cotain conditions, FIS stimulates 
excisive recombination. In addition to these normal reactions, it should be 
appreciated that attP and attBy when placed on the same molecule, can promote 
excisive recombination to generate two excision products, one with attL and one 
with fltt/?. Similarly, intermolecular recombinatioiv between molecules 
containing attL and aUR^ in the presence of Int, IHF and Xis, can result in 
in/egra^e recombination and the generation a/rP and artS. Hence, by flanking 
DNA segments with appropriate combinations of engineered a// sites, in the 
presence of the appropriate recombination proteins, one can direct excisive or 
integrative recombination, as reverse reactions of each other. 

Each of the att sites contains a 15 bp core sequence; individual sequence 
elements of ftmctional significance lie within, outside, and across the boimdaries 
of this common core (Landy, A., Arm. Rev. Biochem. 58:913 (1989)). Efficient 
recombination between the various att sites requires that the sequence of the 
central cooomon region be identical between the lecombining partners, however, 
the exact sequence is now found to be modifiable. Cionsequently, derivatives of 
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the att site with changes within the core are now discovered to recombine as least 
as efficiently as the native core sequences. 

Integrase acts to recombine the attP site on bacteriophage lambda (about 
240 bp) with the attB site on the E. coU genome (about 25 bp) (Weisberg, RA, 
and Landy, A. in Lambda 11^ p. 21 1 (1983), Cold Spring Harbor Laboratory)), to 
produce the integrated lambda gienome flanked by attL (about 100 bp) and attR 
(about 160 bp) sites. In the absence of Xis (see below), this reaction is essentially 
irreversible. The integration reaction mediated by integrase and IHF works 
m vi/z-o, with simple buffer containing sperrnidine. Integrase can be obtained as 
described by Nash, ILA., Methods ofEmymology 100^10-216 (1983). IHF can 
be obtained as described by Fihitowicz, M., et al^ Gene 7-^7:149-150 (1994). 

In the presence of the X protein Xis (excise) integrase catalyzes the 
reaction of attR and attL to form attP and attB^ t e. , it promotes the reverse of the 
reaction described above. This reaction can also be applied in the present 
invention. 

Other Recombination Systems^ Numerous recombinatibn systems from 
various organisms can also be used> based on the teaching and guidance provided 
herein. See, e,g,, Hoess et aly Nucleic Acids Research 14(6):2297 (1986); 
Abremski et aly J, Biol ChenL261 (1)391 (1986); Campbell, J. Bacterial 
77^(23):7495 (1992); Qian et al,, J. Biol Chenu 257(H):7794 (1992); Araki 
et al, J. Mol Biol 2250):25 (1992)). Many of these belong to the integrase 
family of recombinases (Argos etal EMBOJ. 5:433-440 (1986)). Perii^s the 
best studied of these are the Integrase/a// system from bacteriophage X (Landy, 
A. (1993) Current Opinions in Genetics ond Devel 5:699-707), the CxdloxP 
system fix>m bac^^ophage PI (Hoess and Abrmiski (1990)' la Nucleic Acids and 
Molecular Biology^ voL 4. Eds.: Eckstein and Lilley, Berlin-Heidelberg: 
Springer-Veriag; pp, 90-109) , and the FLP/FRT syste^ fiom the Saccharomyces 
cerevisiae 2 fi circle plasmid (Broach e/ al. Cell 29^227-234 (1982)). 

Members of a second &inily of site-specific lecbmbinase^ the resolvase 
femily (e.g:, y6, Tn3 resolvase, HBn, Gin, and Cin) are also known. Menibers of 
this highly related family of recombinases are typically constrained to 
intramolecular reactions (e.g,^ inversions and excisions) and can require host- 
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encoded factors. Mutants have been isolated that relieve some of the 
requirements for host factors (Maeser and Katmmann (1991) Mol Gen, Genet. 
230:170-176), as well as some of the constraints of intramolecular recombination. 

Other site-specific recombinases similar to A Int and similar to P 1 Cre can 
be substituted for Int and Cre. Such recombinases are knovm. In many cases the 
purification of such other recombiilases has been described in the art. In cases 
when they are not known, cell extracts can be used or the enzymes can be 
partially purified using procedures described for Cre and Int. 

While Cre and Int are described in detail for reasons of example, many 
related recombioase systems exist and their application to the described invention 
is also provided according to the present invration. The integrase family of site-^ 
specific recombinases can be used to provide altonative recombination proteins 
and recombination sites for the present invention, as site-specific recombination 
proteins encoded by bacteriophage lambda, phi 80, P22,P2, 186, P4 and PL This 
group of proteins exhibits an imexpectedly large diversity of sequences. Despite 
this diversity, all of the recombinases can be aligned in their C-terminal halves. 

A 40-residue region near the C terminus is particularly well conserved in 
all the firoteins and is homologous to a region near the C terminus of the yeast 2 
mu plasmid Flp protein. Three positions are perfectly conserved within this 
family: histidine, arginine and tyrosine are found at ^spective alignment 
positions 396, 399 and 433 within the well-consa-ved C-terminal region. These 
residues contribute to the active site of this family of recombinases, and suggest 
that tyrosine-433 forms a transient covalent linkage to DNA during strand 
cleavage and rejoining. See, e.g., Argos, P. et al., EMBO J. 5:433-40 (1986). 

Alternatively, IS231 and other Bacillus thuringiensis transi>osable 
elements could be used as recombination proteins and recombination sites. 
Bacillus thuringiensis is an entomopathogenic bacterium vdiose toxicity is due 
to the presence in the sporangia of deha-endotoxin crystals active against 
agricultural pests and vectors of human and animal diseases. Most of the genes 
coding for these toxin proteins are plasmid-bome and are generally structurally 
associated with msertion sequences 0S231,IS232, IS240,ISBT1 andISBT2)and 
transposons (Tn4430 and Tn5401). Several of these mobile elements have been 
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shown to be active and participate in the crystal gene mobility, thereby 
contributing to the variation of bactmal toxicity. 

Structural analyds of the iso-IS23 1 clients indicates that they are related 
to ISl 151 from Clostridium perfringens and distantly related to IS4 and IS186 
from Escherichia coli. Like the other IS4 family members, they contain a 
cons^r\^ed transposase-integrase motif found in otfier IS femilies and retroviruses. 

Moreover, functional data gathered from IS231A in Escherichia coli 
indicate a non-replicative mode of transposition, with a preference for specific 
targets. Similar results were also obtained in Bacillus subtUis and B, 
thuringiensis. See, e.g,, Mahillon, J. et aL, Gemtica P3:13-26 (1994); Canq)bell, 
J Bacterial 7495-7499 (1992). 

The amount of recombinase wiiich is added to drive the recombination 
reaction can be detamined by iisingknoAvn assays. Specifically, titration assay 
is used to detoinine the ^>propriate amoimt of a purified recombinase enzyme, 
or the ap^opriate amount of an extract 

Engineered Recombination Sites. The above recombinases and 
corresponding recombinase sites are suitable for xise in recombination cloning 
according to the present invention. Howcvct, wild-type recombination sites 
contain sequences that reduce the efiSciency or specifidty of recombination 
reactions as ^plied in methods of tiie present invention. «For example, ihultiple 
stop codons in attB, attR, attP, attL and loxP recombinatic»i sites occur in 
multiple, reading frames on both strands, so recombination efiBciencies are 
reducted, e.g,, where the coding sequence must cross the recombination sites, 
(only one reading frame is available on each strand of loxP and attB sites) or 
impossible (in attP, attR or attL)^ 

Accordingly, the preset invention also provides engineered 
recombination sites tiiat ov»x:ome these problems. For example^ att sites can be 
engineered to have one or multiple mutations to enhance specificity or efficiency 
of the recombination reaction and the properties of Product DNAs (e,g^ , att 1 , att2, 
and att3 ates); to dea:ease reverse reaction (e.^, removing PI and HI from atffi). 
The testing of these mutants determines which mutants yield sufficient 



wo 96/40724 PCT/US96/10082 

-27- 

recombinatioiia] activity to be suitable for recombination subcloning according 
to the present invention. 

Mutations can therefore be introduced into recombination sites for 
enhancing site specific recombination. Such mutations include, but are not 
limited to: recombination sites without translation stop codons that allow fusion 
proteins to be encoded; recombination sites recognized by the same proteins but 
differing in base sequence such that they react largely or exclusively with their 
homologous partners allow multiple reactions to be contemplated. Which 
particular reactions take place can be specified by which particular partners are 
present in the reaction mixture. For example, a tripartite protein fu^on could be 
accomplished with parental plasmids containing recombination sites attRl and 
attR2; attLl and attL3; and/or attR3 and attL2. 

There are well known procedures for introducing specific mutations into 
nucleic acid sequences. A number of these are described in Ausubel, RM. et aL^ 
Current Protocols in Molecular Biology, Wiley Interscience, New York (1989- 
1 996), Mutations can be designed into oligonucleotideis, vrfiich can be used to 
modify existing cloned sequences, or in amplification reactions. Random 
mutagenesis can also be employed if appropriate selection methods are available 
to isolate the desired mutant DNA or RNA The presence of the desired 
mutations can be confirmed by sequencing the nucleic%cid by well , known 
methods. " 

The following non-limiting methods can be used to engineer a core region 
of a given recombination site to provide mutated sites suitable for use in the 
present invention: 

1 . By recombination of two parmtal DNA sequences by site^^pecific (e.g. 

attL and attR to give attB) or other (e.g, hbmologous) 
recombination mechanisms. The DNA parental DNA segments 
containing one or more base alterations resulting in the final core 
sequence; 

2. By miitation or mutagCTesis (site-specific, PCR, randoni, spontaneous, 

etc) directly of the desired core seqiience; 



1 
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3. By mutagenesis (site-specific, PGR, random, spontanteous, etc) of 

parental DNA sequences, \^ch are recombined to generate a 
desired core sequence; and 

4. By reverse transcription of an RNA encoding the desired core 
5 sequence. 

The functionality of the mutant recombination sites can be demonstrated 
in v^ys that depend on the particular diaiacteristic that is desired. For example, 
the lack of translation stop codons in a recombination site can be demonstrated 
by e3q)ressing the ^projiriate fusion proteins. Specificity of recombination 

10 betweoi homologous partners can be demonstrated by introducing the appr^ 

molecules into in vitro reactions, and assaying for recombination products as 
described herein or known in the art Other desired mutations in recombination 
sites might include the presence or absence of restriction sites, translation or 
tiansOTption start signals, protein binding sites, and other known functionalities 

15 of nucleic acid base sequences. Genetic selection schemes for particular 

fimctional attributes in the recombination sites can be used according to kn^ 
method steps. For example, fhe modification of sites to provide (from a pair of 
sites that do not interact) partn^ that do interact could be achieved by requiring 
deletion, via recombination between the sites, of a DNA sequence encoding a 

20 toxic substance. Similarly, selection for sites that lemove translation stop 

sequences, the presence or absence of protein binding sites, etc., can be easily 
devised by those skilled in the art 

Accordingly, the present invention provides a nucleic acid molecule, 
comprising at least one DNA segment having at least two engineered 

25 recoiribination sites fiankiiig a Selectable inaric^and/o^ 

wherein at least one of said recombination sites comprises a core region having 
at least one engineered mutatioii that enhances recombination m vitro in the 
formation of a Cointegrate DNA or a Product DNA 

The nucldc a$^d molecule can have at least one mutation that confers at 

30 least one rahancement of said recombination, said enhancCToent selected from the 

groiq> consisting of substantially ficvoring excisive integration; (jS) &voring 
exciave recombination; ^) relieving the requirement for host factors; QSSl) 
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increasing the efficiency of said Cointegrate DNA or Product DNA formation; 
and (iv) increasing the specificity of S2ud Cointegrate DNA or Product DNA 
foimation. 

The nucleic acid molecule preferably comprises at least one 
5 recombination site derived fix)m attB,attP,attL or attR. More preferably the att 

site is selected firom attl, att2, or att3, as described herein. 

In a preferred embodiment, the core region comprises a DNA sequence 
selected firom the group consisting of: 

(a) RKYCWGCTTTYKTRTACNAASTSGB (m-att) (SEQ ID NO:l); 
10 (b) AGCCWGCTTTYKTRTAGNAACTSGB (m-attB) (SEQ ID 

NO:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID 

NO:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

15 NO:4); 

(e) GTTCAGCTTTYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

NO:5); 

or a coiresponding or complementary DNA or RNA sequence, wherein R=A or 
G; K=G or T/U; Y=C or T/U; W=A or T/U; N=A or C or G or T/U; S=Cor G; 
20 and B?=C or G or T/U, as presented in 37 C.RR. §1.8f2, which is entirely 

incorporated herein by refetmce, wliaein the core region does not contain a stop 
codon in one or more reading firames. 

The core region also preferably comprises a DNA sequence selected fix)m 
; Re group consisting of: 

25 (a). AGCCTGCiiriTlGTACAAACTTGT(attBl)(SEQIDNO:6) 

(b) AGCCTGGTlTCTTGTACAAACTTGT(attB2)(SE<3IDNO:7); 

(c) ACCCAGClllUllGTACAAAeTTGT (attBS) (SEQ IDNO:8): 

(d) GTrCAGCTltTTTGTACAAACTrGT(attJ.l)(SEQIDNO:9) 

(e) GTrCAGCHTUlTGTACAAACTTGT(allR!2)(SEQIDNO:10): 
30 if) GTTCAGCTTTCTtGTACAAAGTTGG (attRB) (SEQ ID 

NO: 11); 
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(g) AGCCTGCTTTTTTGTACAAAGTTGG (attLl) (SEQ ID 
NO:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attLl) (SEQ ID 
NO:13); 

5 CO ACCCAGCTTTCTTGTACAAAGTTGG (attLB) (SEQ ID 

NO:14); 

(j) GTTCAGCniil"lGTACAAAGTTGG(attPl) (SEQ IDNO:15); 
(k) GTTCAGCTTTCTTGTACAAAGTTGG (attP2J>3) (SEQ ID 
NO: 16); or a coiresponding or complementary DNA or RNA sequence, 

10 The present invention thus also provides a method for making a nucleic 

acid molecule, comprising providing a nucleic acid molecule having at least one 
engineered recombination site comprising at least one DNA sequence having at 
least 80-99% homology (or any range or value therein) to at least one of SEQ ID 
NOS:l-16, or any suitable recombination site, or which hybridizes under 

IS stringent conditions thereto, as known in the art 

Clearly, there are various t3rpes and permutations of such well-known 
in vitro and in vivo selection methods, each of which are not described herein for 
the sake of brevity. However, such variations and permutations are contemplated 
and considered to be the different embodiments of the present invention. 

20 It is important to note that as a result of the prefetred embodiment being 

invitro recombination reactions, non-biological molecules such as PGR products 
can be manipulated via the present recombinational cloning method. In one 
example, it is possible to clone linear molecules into circular vectors. 

There are a number of ^plications for the preset inventionu These uses 

25 include, but are not limited to, changing vectors, iqypoising promoters with genes, 

constructing genes for fusion proteins, changing copy numbCT, changing 
r^Iicons, cloning into phages, and cloning, e,g. , PGR products (with an ottB site 
at one end and a lopcP site at the other end), genomic DNAs, and cDNAs. 
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The foUowing examples are intended to further illustmte 
embodiments of the invention and are not intended to be limiting in nature. 

Examples 

The present recombinational cloning me&od accomplishes the exchange 
of nucleic acid segments to render something usefiil to the user, such as a change 
of cloning vectors. These segments must be flanked on both sides by 
recombination signals that are in the proper orientation with respect to one 
another. In the examples below the two parental nucleic acid molecules {e.g,^ 
plasmids) are called the Insert Donor and the Vector Donor. The Insert Donor 
contains a segment that will become joined to a new vector contributed by the 
Vector Donor. The recombination intermediate(s) that contain(s) both starting 
molecules is called the Cointegrate(s). The second recombination event produces 
two daughter molecules, called the Product (the desired new clone) and the 
Byproduct 

Buffers 

Various known buffers can be used in the reactions of the present 
invention. For restriction . enzymes, it is advisable to use the buffers 
recommended by the manufacturer. Alternative buffers can be readily found in 
the literature or can be devised by fliose of ordinary skill in the art 

Examples 1-3. One exemplary buffer for lambda integrase is comprised 
of 50 mM tris-HCl, at pH 7.5-7.8, 70 mM KCl, 5 mM speraiidine, 0.5 mM 
EDTA, and OSLS mg/ml bovine serum albumin, and optibnally, 10% glycerol. 

One preferred buffer for PI Cre recoxhbinase is comprised of 50 mM 
Tris-HCl at pH 7.5, 33 mM NaCl, 5 mM spermidine, and 0.5 mg/ml bovine 
serum albtimin. 

The buffer for other site-specific recombinases \\^ch are similar to 
lambda Int and PI Cre are either known the art or can be detemiined 



wo 96/40724 PCTAIS96/1 0082 

-32- 

empirically by the skilled artisans, particularly in light of the above-described 
bnfiFers. 

Example 1: Recombinational Cloning Using Cre and Cre Ji Jnt 

Two pairs of plasmids were constructed to do the in vitro recombinational 
cloning method in two different ways. One pair, pEZC705 and pEZC726 
(Figure 2 A), was constructed with loxP and att sites, to be used with Cre and 
A integrase. The oflier pair, pEZC6p2 and pEZC629 (Figure 3A), contained the 
/oxP (wild type) site for Cre, and a second mutant /ax site, loxF 5J1^ which 
differs from loxP in one base (out of 34 total). The minimum requirement for 
recombinational cloning of the present invention is two recombination sites in 
each plasmid, in gCTieral Xand F, and X' and Y. Recombinational cloning takes 
place if either or both types of site can recombine to form a Cointegrate {e.g. X 
and X\ and if either or both (but necessarily a site different from the type 
forming the Cointegrate) can recombine to excise the Product and Byproduct 
plasmids from the Cointegrate (e.g: Y and F). It is important Aat the 
recombination sites on the same plasmid do not recombine. It was found that the 
present recombinational cloning could be done with Cre alone. 

0 

Cre-Onfy 

Two plasmids were constructed to demonstrate this conception (see 
Figure 3A). pEZC629 was the Vector Donor plasmid. It contained a constitutive 
drug marker (chloramphenicol resistance), an origin of replication, loxP and 
loxP 511 sites, a conditional drug marker (kanamycin resistance i^diose 
e^qiression is controlled by the operator/prompt^.of the tetracycline resistance 
opm>n of transposon TnlO), and a constitutively e}q>ressed gene for the tet 
repressor protein, tetR. K coli cells containing pEZC629 were resistant to 
chloramphenicol at 30 ^g^tnl, but soii^itiye to kstnamycin at 100 ^g^. pEZC6Q2 
was the Insert DorK>r plasmid, \^ch c(nitai^ a different drug marker 
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(ampicaiin resistance), an origin, and loxP and loxP 511 sites flanking a miiltiple 
cloning site. 

This experiment was comprised of two parts as follows: 
Parti: About 75 ng each of pEZC602 and pEZC629 were mixed in a 
total volume of 30 pi of Cre buffer (50 mM Tris-HCI pH 7.5, 33 mM NaCl, 
5 mM speamidine-HCl, 500 ^g/ml bovine serum albumin). Two 10 jil aliquots 
were transferred to new tubes. One tube received 0.5 \i\ of Cre protein (approx. 
4 units per ^1; partially purified according to Abremski and Hoess, J. Biol Chem. 
259: 1 509 (1984)). Both tubes were incubated at 3rC for 30 minutes, then 70"C 
for 10 minutes. Aliquots of each reaction were diluted and transformed into 
DH5a. Following expresision, aliquots were plated on 30 jig/ml 
chloramphenicol; 100 ^g/ml ampidllin plus 200 ^g/ml metibicillin; or 100 pg/ml 
kanamycm. Results: See Table 1. The reaction without Cre gave 1.11x10** 
ampidllin re^stant colonies (fiom the Insert Donor plasmid pEZC602); 7.8x10^ 
chloramphenicol resistant colonies (from the Vector Donor plasmid pEZC629); 
and 140 kanamycin resistant colonies (background). The reaction with added Cre 
gave 7.5x10^ ampicillin resistant colonies (ftom the Insert Donor plasmid 
pEZC602); 6,1x10^ chloramphenicol resistant colonies (from the Vector Donor 
plasmid pEZC629); and 760 kanamyciil resistant colonies (mixture of 
background colonies and colonies from the recombiiiatiSnal cloning Product 
plasmid). Analysis: Because the number of colonies on the kanamycin plates 
was much higher in the preseiice of Cre, many or most of th^ were predicted to 
contain the desired Product plasmid. 



Table 1 


Enzyme 


Ampicillin 


Chlorami^enico] 


Kanamycin 


EfQciency 


None 


1.1x10* 


7.8x10^ 


140 


140/7.8x10^ 0.02% 


Ore 


7.5x10* 


6Jxl0* 


760 


760/6.1x10^0.12% 



Partn: Twenty four colonies from the **+ Cre** tonamycin plates wore 
picked and inoculated into medium containing 1 00 ^g/ml kanamycin. Minipreps 
were done, and the minipr^ DNAs, uncut or cut with Smal or /ffndlll, were 
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electrophoresed. Results: 19 of the 24 minipreps showed supercoilcd plasmid 
of the size predicted for the Product plasmid. All 1 9 showed the predicted Smal 
and Hindni restriction fragments. Analysis: The Cre only scheme was 
demonstrated. Specifically, it was deteraiined to have yielded about 70% (19 of 
24) Product clones. The eflSciency was about 0.1% (760 kanamycin resistant 
clones resulted from 6. 1 xl 0^ chloramphenicol resistant colonies). 

Cre Plus Integrase 

The plasmids used to demonstrate this mcdiod are exactly analogous to 
those used above, except that pEZC726, the Vector Donor plasmid, contained an 
attP site in place of loicP 51 1^ and pEZC705, the Insert Donor plasmid, contained 
an attB site in place of loxP 511 (Figure 2A). 

This experiment was comprised of three parts as follows: 
Part J: About 500 ng of pEZC705 (the Inscat Donor plasmid) was cut 
with Seal, vAnxii linearized the plasmid within the ampicillin resistance gene. 
(This was don^e because the A integrase reaction has been historically done with 
the attB plasmid in a linear state (TH. Nash, personal communication). However, 
it was found later that flie integrase reaction proceeds well with both plasmids 
stq)eTcoiled,) Then, the linear plasmid was edianol precipitated and dissolved in 
20 ^1 ofX integrase buffer (50 mM Tris-HCl, about pH 7.8, 70 mM KCl, 5 mM 
spermidine-HCl, 0.5 mM EDTA, 250 ^g/ml bovine serum albumin). Also, about 
500 ng of the Vector Donor plasmid pEZC726 was ethanol precipitated and 
dissolved in 20 ^ X integrase buffer. Just before use, X integrase (2 pi, 
393 pg/ml) was thawed and diluted by adding 18 iil cold X integrase buffer. 
One pi IHF (integration host &ctor, Z4 mg/ml, an accessory protein) was diluted 
into 150 nl cold X integrase buffer. Aliquots (2 pi) of each DNA were mixed 
with A integrase buffer, with or without 1 pi eac^ Jl integrase and IHF, in a total 
of 10 pL The mixture was incubated at 25X for 45 minute then at 70*C for 
1 0 minutes. Half of eadi reaction was applied to an agarose geL Results: In the 
presence of integrase and IHF, about 5% of the total DNA was converted to a 
linear Cointegiate form. Analysis: Activity of integrase and IMF was confirmed. 
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Part II: Three microKters of each reaction (/.e., with or without integrase 
and IHF) were diluted into 27 \i\ of Cre buffer (above), then each reaction was 
split into two 10 \x\ aliquots (four altogether). To two of these reactions, 0.5 \i\ 
of Cre protein (above) were added, and all reactions were incubated at 3TC for 
5 30 minutes, then at 70*^0 for 1 0 minutes. TE buffer (90 \x\\ TE: 1 0 mM Tris-HCI, 

pH 7.5, 1 mM EDTA) was added to each reaction, and 1 ^l each was transformed 
into E. coli DH5a. The transformation mixtures were plated on 100 ixg/nil 
ampicillin phis 200 ng/ml methicillm; 30 jxg/ml chloramphenicol; or 100 ^g/ml 
kanamycin. Results: iS'eeTable2. 



Table 2 


Enzyme 


Ampicillin 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


990 


20000 


4 


4/ 2x10* = 0.02% 


Cre only 


280 


3640 


0 


0 


Integrase* 
only 


1040 


27000 


9 


9/ 2.7x10* = 0.03% 


Integrase* 
+ Cre 


110 


1110 


76 


76/ 1.1x10' = 6.9% 



♦ Integrase reactions also contained HF. 



Anafysis: The Cre protein impaired transformation. When adjusted for 
20 this effect, the ntmber of kanamycin resistant colonies, compared to the 

reactions, increased more than 100 fold when both Cre and Integrase were used. 
This suggests a specificity of greater than 99%. 

Pari III: 38 colonies were picked from the Integrase plus Cre plates, 
miniprep DNAs were made and cut with Hin6ni to give diagnostic mapping 
25 information. Result: All 38 had precisely the expected fragment sizes. 

Anafysis: The Cre plu^ A intejgrase method was observed to have much higher 
specificity than Cre-alone. Conclusion: The Cre plus k integrase method was 
demonstrated. EflBciency and specificity were much higher than for Cre only. 
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Example 2: Using in vitro Recombinational Cloning to Subclone the 
Chloramphenicol Acetyl Transferase Gene into a Vector for Egression in 
Eukaryotic Cells (Figure 4A) 

An Insert Donor plasmid, pEZC843, was constructed, comprising the 
chloramphenicol acetyl transferase gene of £1 co//, cloned betwem loxP and cOtB 
sites such that tiie loxP site was positioned at the 5 '-end of the gene (Figure 4B). 
A Vector Donor plasmid, pEZClOOS, was constructed, which contained the 
cytomegalovirus eukaiyotic promoter ^posed to a loxP site (Figure 4C). One 
microliter aliquots of each supercoiled plasmid (about 50 ng crude miniprep 
DNA) were combined in a ten microliter reaction containing equal parts of 
lambda integrase buffer (50 mM Tris-HCl, pH 7.8, 70 mM KCl, 5 mM 
spermidine, 0.5 mM EDTA, 0:25 mg/ml bovine serum albumin) and Cre 
recombinase buffer (50 mM Tris-HQ, pH 7.5, 33 mMNaCl, 5 mM spemiidine, 
0.5 mg/ml bovine serum albumin), two units of Cre recombinase, 16 ng 
integration host fector, and 32 ng lambda integrase. After incubation at 30'C for 
30 minutes and 75*C for 10 minutes, one microliter was transformed into 
competent coli strain DH5a (Life Technologies, Inc.). Aliquots of 
transformations were spread on agar plates containing 200 ^gAnl kanamy cin and 
incubated at 37*C overnight An otherwise identical control reaction contained 
the Vector Donor plasmid only. The plate receiving 10?^of the control reaction 
transfonnation gave one colony; the plate receiving 10% of the recombinational 
cloning reaction gave 144 colonies. These numbers suggested that greater than 
99% of the recoinbinatioii;^ cloning colonies contained the desired product 
plaanid. Miniprep DNA made fiom six recombinational cloning colonies gave 
the predicted size plasmid (5026 base pairs), CMVPnxL Restriction digestion 
wifli Afcol gave the fiagments predicted for Ac chloramphoiicol acetyl transferase 
clorted downstream of the CMV promoter for all six plasnids. 
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Example 3: SubclonedDNA Segments Flanked by attB Sites WUhout Stop 
Codons 



Parti: Background 

The above examples are suitable for transcriptional fiisions, in which 
transcription crosses recombination sites. However, both attR and loxP sites 
contain multiple stop codons on both strands, so translational fusions can be 
difBcuIt, v^diere the coding sequence must cross the recombination sites, (only one 
reading fiame is available on each strand of loxP sites) or impossible (in attR or 
attL), 

A principal reason for subcloning is to fuse protein domains. For 
example, fiision of the glutathione S-transferase (GST) domain to a protein of 
interest allows the fusion protein to be purified by affinity chromatography on 
glutathione agarose (Pharmacia, Inc., 1995 catalog). If the protem of interest is 
fused to runs of consecutive histidines (for example His6), the fusion protein can 
be purified by affinity chromatography on chelating resins containing metal ions 
(Qiagen, Inc.). It is often desirable to compare amino terminal and carboxy 
terminal fusions for activity, solubility, stability, and the like. 

The attB sites of the bacteriophage X integration s^em were examined 
as an alternative to loxP sites, because thqr are small (25 bp) and have some 
sequence flexibility (Nash, HA. et aL, Proc Natl, Acad ScL USA 5^:4049-4053 
(1987). It was not previously suggested that multiple mutations to remove aU 
stop codes would result in useful recombination sites for recombinational 
subcloning. 

Using standard nomenclature for site specific recombination in lambda 
bacteriophage (Weisber, mZ^3tiw6db /ZT, Hendrix, ^i/., eds.. Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY (1989)), the nucleotide regions that 
particii>ate in the recombination reaction in an £ coli host cell are represented 
as follows: 

attP — PI — HI- -P2 — X- -H2 - -C-O-C — H • — P • 1 — P • 2 — P • 3 - - 

+ 

attB --B-0-B'-- 

Int, IHF It Xis, Int, IHF 

attR — P1~H1 — P2--X--H2--C-0-B* — 

attX --B-0-C— H' — P'l--P«2 — P'3--/ 



where: O represents the 15 bp core DNA sequence found in both the phage and 
K coli genomes; B and B' represent approximately 5 bases adjacent to the core 
in the R coli genome; and PI, HI, P2, X, H2, C, C, IT, Fl, F2, and P3 represent 
known DNA sequences encoding protein binding domains in the bacteriophage 
Agenome,- 

The reaction is rev^sible in the presence of the protein Xis (exdsionase); 
recombination between atlL and attR precisely excise ^ k genome fiom its 
integrated st^e^ regenerating the circular X ge^ 
-E cp// genome containing attB^ 

Part n: Comtrucdon and Testing of Plasmids Containing Mutant att Sites 

Mutant altL. and attR sites were constnicte^ Importantly, Landy e/ai 
(Amt Jtev, Biochem. 58:913 (1989)) observed that deletion of the PI and HI 
domains of attP facilitated the excision reaction and eliminated the integration 
reaction, Aereby making the excision reaction inevendble. Therefore, as 
mutations were introduced in attR, the PI and HI domams were also deleted, 
attR sites in the present example lack the PI and HI regions and have the Ndel 
site removed (base 27630 changed fiom C to G), and contain sequences 



wo 96/40724 PCT/US96/10082 

-39- 

corresponding to bacteriophage X coordinates 27619-27738 (GenBank release 
92.0, bgiLAMCG, "Complete Sequence of Bacteriophage Lambda"). 

The sequence of attB produced by recombination of wild type attL and 
attR sites is: 



B O B* 

5 attBwt: 5' AGCCT GCTTTTTTATACJt&a CTnaS 3» (SEQ. ID 

NO:31) 

3' TCGGA CGAAA AAA7ATGA7T GAACT 5» * 

The stopcodons are italicized and undeilined. Note that sequences of attL, attR, 
and attP can be derived from the attB sequence and the boundaries of 
bacteriophage X contained within attL and attR (coordinates 2761 9 to 278 1 8). 
When mutant attRl and attLl sites were recombined the sequence attBl 
10 was produced (mutations in bold, large font): 



^"Bl: 5. AGCCT gcttTTTtGtacI^ CTTGT 3 ^^^^^ IDNO:€) 

t 

3 • TCGGA cGAAAAAaCaTGTtT GJ^cA 5 • 

Note that the four stop codons are gone* 

When an additional mutation was introduced in the attRl and attLl 
sequences (bold), attR2 and attL2 sites resulted. Recombination of attR2 and 
15 attL2 produced the attB2 site: 

B O B' 

attB2: 5- 'aGCCT gCTTTCtTGIACAAA ^ <^ 3' (SEQ. IDN0:7} 

^' ^^^^ cgaaaGaacatgttt 5' 

The recombination activities of the above attL and attR sites wereassayed 
as follows. The attB site of plasmid pEZC705 (Figure 2B) was replaced with 
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attLwt, attLl, or attL2. The attP site of plasmid pEZC726 (Figure 2C) was 
replaced with attRwt (lacking regions PI and HIX attRl, or attR2. Thus, the 
resulting plasmids could repombine via their loxP sites, mediated by Cre, and via 
their attR and attL sites, mediated by Int, Xis, and IHF. Pairs of plasmids were 
mixed and reacted with Cre, Int, Xis, and IHF, transformed into K coli competent 
cells, and plated on agar containing kanamycin. The results are presented in 
Tables: 



Tables 



Vector donor att site 


G^e donor att she 


# of kanamycin resistant colonies* 


at£Rwt(pEZC130I) 


N(»ie 


1 (background) 


tt 


attLwt(pEZC1313) 


147 


n 


attLl (pEZC13 17) 


47 


n 


attL2 (pEZC132I) 


0 


attRl (pEZC1305) 


None 


1 (bacl^round) 


W 


atflLwtO>EZC1313) 


4 


tf 


attLl (pEZC1317) 


128 


M 


attL2 CpEZC1321) 


0 


atlR2(pEZC1309) 


None 


9 (background) 


tt 


attLwt(pEZC1313) 


0 


If 


attL2(pEZC1317) 


0 


tt 


attL2 (pEZC1321) 


209 



(* 1% of each transformation was spread on a kanamycin pl&.) 



Hie above data show that \^^eas tibe wild type att and attl sites 
recombine to a small extent, the attl and att2 sites do not recombine detectably 
with each other, 

PariSL Recombination was demonstrated \^^a the core region of both 
attB sites flanking the DNA segment of interest did not contain stop cbdons. The 
physical state of the participating plaids was discovered to influence 
recombination c&ciency. 

The ^>propriate att ates were moved into pEZC705 and pEZC726 to 
make the plasmids pEZC1405 (Rgure 5G) (attRl and attR2) and pEZC1502 
(Hgure 5H) (attLl and attl^). The dedred DNA segment in diis e)q)erin3e^^ 
a copy of file chloranq)henicol resistance gene doned between the two attL sites 
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of pEZC1502. Pairs of plasmids were recombined in vitro using Int, Xis, and 
IHF (no Cre because no loxP sites were present). The yield of desired kanamycin 
resistant colonies was detemiined "when both parental plasnuds were circular, or 
when one plasmid was circular and the other linear as presented in Table 4: 



Table 4 



Vector donor' 


Gene donor' 


Kanamycin resistant colonies^ 


Circular pEZC 1405 


None 


30 


Circular pEZC1405 


Circular pEZC 1502 


2680 


Linear pEZCMOS 


None 


90 


Linear pEZCMOS 


Circular pEZCl 502 


172000 


Circular pEZCHOS 


Linear pEZCl 502 


73000 



• DNAs were purified with Qiagen columns, concentrations determined by 
A260, and linearized with Xba I (pEZCMOS) or AlwN I (pEZC1502) Each 
reaction contained 100 ng of the indicated DNA. All reactions (10 pi total) 
contained 3 |il of ^izyme mix (Xis, Int, and IHF). After incubation (45 minutes 
at 25**, 10 mmutes at 65*), one pi was used to transform E, coli DH5a cells. 
^ Number of colonies expected if the entire transformation reaction (1 ml) had been 
plated. Either 100 pi or 1 pi of the transformations were actually plated. 

Analysis: Recombinational cloning using mutant attR and attL sites was 
confirmed. The desired DNA segment is subcloned between attB sites that do not 
contain any stop codons in either strand. The enhanced 3deld of Product DNA 
(when one parent was Imear) was unexpected because of earlier observations that 
the excision reaction was more efBcient when both participating molecules were 
supercoiled and proteins were limiting (Nunes-Duby e/ a/.. Cell 50:779-788 
(1987). 

Example 4: Demonstration of Recombinational Cloning Without Inverted 
Repeats 



Part i: Rationale 



The above Example 3 showed that plasmids containing inverted repeats 
of the appropriate recombination sites (for example, attLl and attL2 in plasmid 
pEZC1502) (Figure 5H) could recombine to give the desired DNA segment 



wo 96/40724 PCTAJS96/1 0082 

-42- 

flanked by attB sites without stop codons, also in inverted orientation. A concern 
was the in vivo and in vitro influence of the inverted repeats. For example, 
transcription of a desired DNA segment flanked by attB sites in inverted 
orientation could yield a single stranded RNA molecule that might fomi a haiipin 
structure, thereby inhibiting translation. 

Inverted orientation of similar recombination sites can be avoided by 
placing the sites in direct repeat arrangement att sites. If parental plasmids each 
have a wild type attL and wild type^ attR site, in direct repeat the Int, Xjs, and IHF 
proteins will simply remove the DNA segment flanked by those sites in an 
intramolecular reaction. HowevCT, the mutant sites described in the above 
Example 3 suggested fliat it might be possible to inhibit the intramolecular 
reaction while allowing the intermolecular recombination to proceed as desired. 

PartJI: Structure of Plasmids WUhoui Inverted Repeats f^^^ 
Cloning 

The attR2 sequence in plasmid pEZC1405 (Figure 5G) was replaced with 
attL2, in the opposite orientation, to make pEZC1603 (Figure 6A). The attL2 
sequence of pEZC1502 (Figure 5H) was replaced with attR2, in the opposite 
orientation, to make pEZC1766 (Figure 6B). Each of tlnpse plasmids contamed 
mutations in the core region that make intramolecular reactions b^ween attl and 
att2 cores veiy inefficient (see Example 3, above). 

Plasmids pEZC1405, pEZC1502, pEZG1603 and pEZC1706 were 
purified on Qiagen columns (Qiagen, Inc.). Aliquots of plasmids pEZC1405 and 
pEZC1603 were linearized with Xba I. Aliquots of plasmids pEZC 1502 and 
pEZC1706 were linearized with AlwN I. One hundred ng of plaids were 
mbced in buffer (equal volumes of 50 Tris HQ pH 7^, 25 mM Tris HCl pH 
8.0, 70 mM KCl, 5 inM spermidine, 0.5 mM EDTA, 250^g^ BSA, 10% 
glycerol) containing Int (43 .5 ng), Xis (4 J ng) and IHF (8. 1 ng) in a final volume 
of 10 fd. Reactions were incubated for 45 minutes at 25^*0, 10 minutes at 65°C 
and 1 jil was transformed into K coli DH5a. After expression, aliquots were 
spread on agar plates containing 200 ^g^ml kanamycin arui incubated at 37**C. 
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Results, expressed as the number of colonies per 1 \x\ of recombination 
reaction are presented in Table 5: 



Tables 



Vector Donor 


Gene Donor 


Colonies 


Predicted % product 


Circular 1405 




100 




Circular 1405 


Circular 1502 


3740 


3640/3740 = 97% 


Linear 1405 




90 




Linear 1405 


Circular 1502 


172,000 


171,910/172,000 = 99.9% 


Circular 1405 


Linear 1502 


73,000 


72,900/73,000 = 99.9% 










Circular 1603 




80 




Circular 1603 


Circular 1706 


410 


330/410 = 80% 


Linear 1603 




270 




Linear 1603 


Circular 1706 


7000 


6730/7000 = 96% 


Circular 1603 


Linear 1706 


10,800 


10,530/10,800 = 97% 



Analysis. In all configurations, i.e., circular or linear, the pEZC1405 x 
pEZC1502 pair (with att sites in inverted repeat configuration) was more efiBcient 
than pEZC1603 x pEZC1706 pair (with att sites mutated to avoid hairpin 
formation). The pEZC1603 x pEZC1706 pair gave higher backgrounds and 
lower efiBciencies than the pEZC1405 x pEZC1502 pair^ While less efficient, 
80% or more of the colonies firom the pEZC1603 x pEZC1706 reactions were 
expected to contain the desired plasmid product Making one partner linear 
stimulated the reactions in all cases. 

Part III: Cohjlrmationof Product Plasndds' Structure 

Six colonies each fix)m the linear pEZCI 405 (Figure 5G) x circular 
pEZC1502 (Figure 5H), circular pEZC1405 x linear pEZC1502, Imear 
pEZC1603 (Figure 6A) x circular pEZC1706 (Figure 6B), and circular 
pEZC1603 X linear pEZC1706 reactions were picked into rich medium and 
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miniprep DNAs were prepared. Diagnostic cuts with Ssp I gave the predicted 
restriction fragments for all 24 colonies. 

Anafysis. Recombination reactions between plasmids with mutant attL 
and attR sites on the same molecules gave the desired plasmid products with a 
high degree of specificity. 



Example 5: Recombinational Cloning with a Toxic Gene 
Pari I: Background 

Restriction enzyme Dpn I recognizes the sequence GATC and cuts that 
sequence only if the A is methylated by the dam methylase. Most commonly 
used K con strains are dam*. Expression of Dpn I in dam* strains ofK coli is 
lethal because the chromosome of the cell is chopped into many pieces. 
However, in dam' cells e?q)ression of Dpn I is iimocuous because the 
chromosorhe is immime to Dpn I cutting. 

In the general recombinational cloning scheme, in \^ch the vector donor 
contains two segments C and D separated by recombination sites, selection for 
the desired product depends upon selection for the presence of segment D, and 
the absence of segment C In the original Example segment D contained a drug 
resistance gene (Km) that was negatively controlled by a repressor gene found on 
segment C. When C was pr^ent, cells containing D were not resistant to 
kanamycin because the resistance gene was turned o£F. 

The Dpn I gene is an example of a toxic gene that can replace the 
repressor gene of the above embodiment If se^ent C expresses the Dpn I gene 
product, transforming plasmid CD into a dam^ host kills the cell. If-segmentD 
is transferred to a new plasmid, for exanq>le by recombinational clonings then 
selecting for the drug marker will be successful because the toxic gene is no 
longCTpresoit 
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Partll: Construction of a Vector Donor Using Dpn I as a Toxic Gene 

The gene encoding Dpn I endonuclease was amplified by PGR iising 
primers 5'CCA CCA CAA ACG CGT CCA TGG AAT TAC ACT TTA ATT 
TAG3' (SEQ. ID NO: 17) and 5*CCA CCA CAA GTC GAC GCA TGC CGA 
CAG CCT TCC AAA TGT3' (SEQ. ID NO: 1 8) and a plasmid containing the Dpn 
I gene (derived from plasmids obtained from Sanford A. Lacks, Brookhaven 
National Laboratory, Upton, New Yoric; also available from American Type 
Culture Collection as ATCC 67494) as the template. 

Additional mutations were introduced into the B and B* regions of attL 
and attR, respectively, by amplifying existing attL and attR domains with primers 
containing the desired base changes. Recombination of the mutant attL3 (made • 
with oligo Xisl 15) and attR3 (made with oligo Xisl 12) yielded attB3 with the 
following sequence (differences from attBl in bold): 

BOB* 
ACCCA GCTTTCTTGTACA3\A GtGGT (SEQ, ID NO: 8) 

tGggt cgaaaGaacatgttt CACCA 

The attL3 sequence was cloned in place of attL2 of an Existing Gene Donor 
plasmid to give the plasmid pEZC2901 (Figure 7A). The attR3 sequence was 
cloned in place of attR2 in an existing Vector Donor plasmid to give plasmid 
pEZC2913 (Figure 7B) I>pn I gene was cloned into plasmid pEZC2913 to replace 
the tet repressor gene. The resulting Vector Donor plasmid was named 
pEZC3101 (Rgure 7C). When pEZC3101 was transformed into the dam" strain 
SCSI 10 (Stratagene), hundreds of colonies resulted. When the same plasmid was 
transformed into the dam+ strain DH5a, only one colony was produced, even 
though the DH5a cells were about 20 fold more competent than the SCSI 10 
cells. When a related plasmid that did not contain the Dpn I gene was 
transformed into the same two cell lines, 28 colonies were produced from the 
SCSI 10 cells, vrfiile 448 colonies resulted from the DH5a cells. This is evidence 
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that the Dpn I gene is being expressed on plasmid pEZC3 101 (Figure 7C), and 
that it is killing the dam* DH5a cells but not the dam" SCSI 10 cells. 



Part in: Demonstration of Recombinational Cloning Using Dpn I Selection 

A pair of plasmids was used to demonstrate recombinational cloning with 
5 selection for product dependent iq>on the toxic gene Dpn I. Plasmid pEZC3 101 

(Figure 7C) was linearized with Mlu I and reacted with circular plasmid 
pEZC2901 (Figure 7A). A second pair of plasmids using selection based on 
control of dmg resistance by a repressor gene was used as a control: plasmid 
pEZC1802 (Figure 7D) was linearized wifli Xba I and reacted with circular 
10 plasmid pEZC1502 (Figure SH). Eight microliter reactions containing the same 

buflFer and proteins Xis, Int, and IHF as in previous examples were incubated for 
45 minutes at 25**C, then 10 minutes at 75**C, and 1 //Laliquots were transformed 
into DH5a O-c, dam+) competent cells, as presented in Table 6. 

Tables 



Reaction # 


Vector donor 


Basis of selection 


Gene donor 


Colonies 


1 


pEZC3101/Mlu 


Dpnitoxichy 




3 . 


2 


pEZC3101/Mtu 


Dpn I toxicity 


Circular p£ZC2901 


4000 


3 


pEZC1802/Xba 


Tet repressor 




0 


4 


pEZC18Q2/Xba 


Tetrepressor 


Circular p£ZC15Q2 


12100 



20 Miniprep DNAs were prepared from four colonies j&om reaction #2, and 

cut with restriction enzyme Ssp I. All gave the predicted fragments. 

Analysis: Subcloning using selection with -a toxic gene was 
dCTtionstrated. Plasmids of the predicted stracture were produced. : 
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Example6: Cloning of Genes with Uracil DNA Glycosylase and Subcloning 
of the Genes with Recombinational Cloning to Make Fusion Proteins 

Part I: Converting an Existing Ejqpression Vector to a Vector Donor for 
Recombinational Cloning 

5 A cassette useiiil for converting existing vectors into functional Vector 

Donors was made as follows. Plasmid pEZC3 101 (Figure 7C) was digested with 
Apa I and Kpn I, treated with T4 DNA polymerase and dNTPs to render the ends 
blunt, fiirther digested with Sma I, Hpa I, and AlwN I to render the undesirable 
DNA fiagments small, and the 2.6 kb cassette containing die attRl - Cm*^ - Dpn I 
10 - attR-3 domams was gel purified. The concentration of the purified cassette was 

estimated to be about 75 ng DNA/fil. 

Plasmid pGEX-2TK (Figure 8A) (Pharmacia) aUows fiisions betwe^ the 
protem glutathione S transferase and any second codmg sequence that can be 
inserted m its multiple cloning site. pGEX-2TK DNA was digested with Sma I 
15 and treated with alkalme phosphatase. About 75 ng of the above purified DNA 

cassette was Kgated with about 100 ng of the pGEX-2TK vector for 2.5 hours in 
a 5 j4 ligation, thrai 1 yl was transfomied into competoat BRL 3056 cells (a dam- 
derivative of DHIOB; dam- strams commercially available include DM1 ftom 
Life Technologies, Inc., and SCS 110 fiom Stratagene). Aliquots of the 
20 transframation mixture were plated on LB agar containing 1 00 pg/ml ampiciUm 

(resistance gene present on pGEX-2TK) and 30 ng/ml chloiamphenicol 
(resistance gene present on the DNA cassette). Colonies were picked and 
miniprep DNAs were made. The orientation of the cassette in pGEX-2TK was 
determined by diagnostic cuts witii EcoR L A plasmid with the desired 
25 orientation was named pEZC3501 ORgurc 8B). 
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Pari II: Cloning Reporter Genes Into an Recombinational Cloning Gene 
Donor Plasmid in Three Reading Frames 

Uracil DNA glycosylase (UDG) cloning is a method for cloning PGR 
amplijScadon products into cloning vectors (U.S. patent No. 5334,515, entirely 
5 incorporated herein by reference). Briefly, PGR amplification of the desired 

DNA segment is performed Avith primers that contain uracil bases in place of 
thymidine bases in flieir 5' ends. When such PGR products are incubated with the 
enzyme UDG, the uracil bases are specifically removed. The loss of these bases 
weakens base pairing in the aids of the PGR product DNA, and v^en incubated 

10 at a suitable temperature (e.g., 31^C\ the ends of such products are largely single 

stranded. If such incubations are done in the presence of linear cloning vectors 
containing protrading 3* tails that are complemmtary to the 3' ends of the PGR 
products, base pairing efBdently anneals the PGR products to &e cloning vector. 
When the annealed product is introduced into JSl coli cells by transformation, in 

15 vivo processes efficiently convert it into a recombinant plasmid. 

UDG cloning vectors that enable cloning of any PGR product in all three 
; reading frames were prepared fiom pEZG3201 (Figure 8K) as follows. Eight 
oligonudeotides were obtained from Life Technologies, Inc. (all written 5* - 3': 
rfl top (GGGG GAT TAG GAT ATG CGA AGG AGG GAA AAG GTG TAT 

20 TTT GAG GGT) (SEQ. ID NO:19), ril bottom (GAG GTT TTC GGT CGT 

. TGG GAT ATC GTA ATGXSEQ. ID NO:20), rf2 top (GGGCA GAT TAG 
GAT ATG GGA AGG AGG GAA AAC CTG TAT TTT GAG GGT)(SEQ. ID 
NO:21), rf2 bottom (GAG GTT TTG GGT GGT TGG GAT ATG GTA 
ATG TKSEQ. ID NOi22), rO top (GGGGAA GAT TAG GAT AtG GGA AGG 

25 AGG GAA AAG GTG TAT TTT GAG GGT)(SEQ. ID Nb:23), rO bottom 

(GAG GTT TTC GGT GGT TGG GAT ATC GTA ATG TT)(SEQ. ID NO:24). 
carboxy top (AGG GTT TAG GTG GAGXSEQ. ID NO:25) and carboxy 
bottom (TGGA GTG GAG GTA AAG GGT TGG GAG TTA TTAXSEQ. ID 
NO:26). The rfl, 2, and 3 top strands and the carboxy bottom strand 'were 

30 phosphoiylated on thdr 5' aids widi T4 polynucleotide kinase, and thm the 

conq>lementaiy strands of eadi pair woe hybridized. Plasmid pEZG3201 

BNSOOClDt <WO_G6<0ra4A1_l_> 
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(Figure 8K) was cut with Not I and Sal I, and aliquots of cut plasmid were mixed 
with the caiboxy-oKgo duplex (Sal I end) and either the rfl , rf2, or rf3 duplexes 
(Not I ends) (10 ng cut plasmid (about 5 pmol) mixed with 250 pmol carboxy 
oligo duplex, split into three 20 pi voliimes, added 5 jil (250 pmol) of rfl , rf2, or 
if3 daplex and 2 nl = 2 units T4 DNA ligase to each reaction). After 90 minutes 
of ligation at room temperature, each reaction was applied to a preparative 
agarose gel and tiie 2.1 kb vector bands were eluted and dissolved in 50 nl of TE. 

Partni: PCRofCATandphoA Genes 

Primers were obtained from Life Technologies, Inc., to amplify the 
chloramphenicol acetyl transferase (CAT) gene from plasmid pACYC184, and 
phoA, the alkaline phosphatase gene from R colL The primers had 12-base 5' 
extensions containing uracil bases, so that treatment of PGR products with uracil 
DNA glycosylase (UDG) would weaken base pairing at each end of the DNAs 
and allow the 3' strands to anneal with the protruding 3' ends of the rfl, 2, and 3 
vectors described above. The sequences of the primers (all written 5' - 3') 
were: CATIeft,UAUUUUCAGGGU ATGGAGAAA AAA ATC ACTGGA 
TAT ACC (SEQ. ID NO:27); CAT right, UCC CAC UUA UUA CGC CCC 
GCC CTG CCA CTC ATC (SEQ. ID NO:28); phoA left, uAu UUU CAG GGU 
ATG CCT GTT CTG GAA AAC CGG (SEQ. ID NO:29); and phoA right, UCC 
CAC UUA UUA TTT CAG CCC CAG GGC GGC TTT C (SEQ. ID NO:30). 
The primers were then used for PCR reactions using known method steps (see, 
e,g,, U.S. patetit No. 5,334,515, entirely incorporated herein by reference), and 
the polymerase chain reaction amplification products obtained with these primers 
Comprised the CAT or phoA genes with the initiating ATGs but without any 
transcriptional signals. In^ditipii, the uracilH:;ontaining sequences on the amino 
termini encoded tfie cleavage site for TEV protease (Life Technologies, Inc.), and 
those on the caxboxy terminal encoded consecutive TAA nonsense codonsl 

Unpurified PCR products (about 30 ng) were mixed with the gel purified, 
linear rfl, rQ, or rf3 cloning vectors (about 50 ng) in a 1 0 nl reaction containing 
IX REact 4 buffer (LIT) and 1 unit UDG (LTI). After 30 minutes at 37"C, 1 ill 
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aliquots of each icaction were transformed into competent E. coli DH5a cells 
(LIT) and plated on agar containing 50 ^g/ml kanamycin. Colonies were picked 
and analysis of miniprep DNA showed that the CAT gene had been cloned in 
reading ftame 1 (pEZC3601)(Figure 8C), reading frame 2 (pEZG3609)(Figure 
8D) and reading frame 3 (pEZC3617XFigure 8E), and that the phoA gene had 
been cloned in reading frame 1 (pEZC3606)(Figure 8F), reading frame 2 
(pEZC3613XFigure 8G) and reading frame 3 (pEZC3621)(Figure 8H). 

PartIV: SubcloningofCATorphoAfrom XJDG Cloning Vectors into a GST 
Fusion Vector 

Plasmids encoding fusions between GST and either CAT or phoA m all 
three reading frames were constructed by recombinational cloning as follows. 
Miniprep DNA of GST vector donor pEZC3501 (Figure 8B) (derived from 
Pharmacia plasmid pGEX-2TK as described above) was linearized with Cla I. 
About 5 ng of vector donor were mixed with about 10 ngeadi of the s^propriate 
circular gene donor vectors containing CAT or phoA in 8 ^1 reactions containing 
buffer and recombination proteins Int, Xis, and IHF (above). After incubation, 
1 \d of each reaction was transformed into E. coli strain DH5a and plated on 
ampicillin, as presented in Table 7. 



Table 7 



DNA 


.Colonies (10% of each transformation) 


Linear vector donor (pEZC350l/Cla) 


0 


Vector donor + CAT ifl 


no 


Vector donor + CAT rf2 


71 \ 


Vector doncM- + CAT rO 


148 


Vector dontw + lApA rfl 


121 


Vector d(»)or + jdioA 


128 


Vector donor + phoA rG 


. 31 
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Part V: Expression of Fusion Proteins 
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Two colonies fix)m each transfonnation were picked into 2 ml of rich 
medium (CircleGrow, BiolOl Inc.) in 17 x 100 mm plastic tubes (Falcon 2059, 
Becton Dickinson) containing 100 ^g/ml ampicillm and shaken vigorously for 
about 4 hours at 37**C, at which time the cultures were visibly turbid. One ml of 
each culture was transferred to a new tube containing 10 \i\ of 10% (w/v) IPTG 
to induce expression of GST. After 2 hours additional incubation, all cultures had 
about the same turbidity; the A600 of one culture was 1 .5. Cells bom 0.35 ml 
each culture were harvested and treated with sample buffer (containing SDS and 
p-mercaptoethanol) and aliquots equivalent to about' 0.1 5 A600 units of cells 
were applied to a Novex 4-20% gradient polyacrylamide gel. Following 
electrophoresis the gel was stained with Coomassie blue. 

Results: Enhanced expression of single protein bands was seen for all 12 
cultures. The observed sizes of these proteins correlated well with the sizes 
predicted for GST being fused (through attB recombination sites without stop 
codons) to CAT or phoA in three reading frames: CAT rfl = 269 amino acids; 
CAT r£2 = 303 amino acids; CAT rf3 = 478 amino acids; phoA rfl = 282 amino 
acids; phoA rf2 = 280 amino acids; and phoA rO == 705 amino acids. 

Analysis: Both CAT and phpA genes vvere subclon'&d into a GST fusion 
vector in all three reading frames, and e3q)ression of the six fusion proteins was 
demonstrated. 

While the foregoing invention has been described in some detail for 
purposes of clarity and undcrstandmg, it will be predated by one skilled in the 
art from a reading of this disclosure that various changes in form and detail can 
be made without departing from the true scope of the invention and appended 
claims. All patrats and publications cited herein are entirely incorporated herein 
by reference. 
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1. A Vector Donor DMA molecule comprising a first DNA segment 
and a second DNA segment, said first or second DNA segment containing at least 
one Selectable marker, wherein the first and second segments are separated cither 
by, (i) in a circular Vector Donor, a first and a second recombination site, or 
(ii) in a linear Vector Donor, at least a first recombination site, wherein each pair 
of flanking recombination sites are engineered and do not recombme with each 
other. 

2. A Vector Donor DNA molecule accordmg to claim I, wherein flie 
Selectable marker is at least one DNA s^ent selected fiom the group consisting 
of: 

(i) a DNA segment that encodes a product that provides 
resistance against otherwise toxic compounds; 

0i) a DNA segment that encodes a product that is otherwise 
lacking in the recipient cell; 

(iii) a DNA segment that encodes a i^oduct that suppresses the 
activity of a gene product; 

Ov) a DNA segmrat that encodes a product tfiat can be re^ 
idmtified; 

(v) aDNA segment that raicodes a product that is detrinental 
to cell survival and/or fimction; 

(vi) a DNA segment that inhibits the activity of any of the 
DNA segments of OHv) above ; 

(vii) a DNA segment that binds a product that modifies a 
substrate; 

(viii) a DNA segmrat that provides for isolation of a desired 
molecule; and 

Ox) a DNA segment that encodes a specific nucleotide 
sequence v^^ch can be otherwise non-functional; and 
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(x) a DNA segment that, when absent, directly or indirectly 
confers sensitivity to particular compounds. 

3. A Vector Donor DNA according to claim 2, \s*erein said 
Selectable marker is at least one selected from the group consisting of an 
antibiotic resistance gene, a tRNA gene, an auxotrophic maiker, a toxic gene, a 
phenotypic marker, an antisense oligonucleotide; a restriction endonuclease; a 
restriction endonuclease cleavage site, an en2yme cleavage site, a protein binding 
site; and a sequence complementary PGR primer. 

4. AVectorDonorDNAmoleculeaccordingtoclaiml,whereinsaid 
Selectable marker comprises at least one inactive fragment of a Selectable 
markCT, v^4ierein the inactive fragment is capable of reconstituting a functional 
Selectable niarker vs^en recombined across said first or second recombin^ 
with a further DNA segment comprising another inactive fragment of the 
Selectable marker. 

5. An Insert Donor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites are engineered and do not 
recombine with each other. 

6. An Insert Donor DNA molecule according to claiim 5, \rfierein 
said desured DNA segment codes for iat least one selected froin the group 
consisting of a cloning site, a restriction site, a promoter, an'6p»eron, an origin of 
replication^ a functional DNA, an antisense RNA, a PGR fi-agment,^ protein or 
a protein firagment 

7. A lot comprising a container being compartmentalized to receive 
in close confinement therein at one compartment, viierein a first compartment 
contains a Vector DorK>r DNA molecule comprising a first DNA segment and a 
second DNA segment, said first or second DNA segment containing at least one 
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Selectable marker, wherein the first and second segments are flanked either by, 
(i) in a circular Vector Donor, a first and a second recombination site, or (ii) in 
a linear Vector Donor, a first recombination site, v^erein each pair of flanking 
recombination sites are engineered and do not recombine with each other. 

8. A kit according to claim 7, fiirther comprising a second 
compartment containing an Insert Donor DNA molecule con^rising a desired 
DNA segment flanked by a first recombination site and a second recombination 
site, wherein the first and second recombination sites are engineered and do not 
recombine with each other. 

9. A kit according to claim 7, fiirther comprising an additional 
compartm^t containing at least one recombination protein capable of 
recombining a DNA segment comprising at least one of said recombination sites. 

10. A nucleic acid molecule, comprising at least one DNA segment 
having at least two recombination sites flanking a Selectable marker or a desired 
DNA segment, \^erein at least one of said recombination sites comprises a core 
region having at least one engineered mutation that enhances recombination in 
vftro in the fomiation of a Cointegratc DNA or a Produclf DNA. 

,11. A nucleic acid molecule according to claim 10, \^erein said 
mutatipn confers at least one ^ihancem^t of said recombination, said 
enhancement selected firom the group consisting of substantially (i) favoring 
excisive recombination; (jS) &voring integrative recombination; ^iO relieving the 
requirement for host fectors; Ov) increasing the efiflcienQr of said Coint^grate 
DNA or Product DNA fomiation;(v) increasing the specificity of said Cbintegrate 
DNA or Product DNA formation; and contributes desirable attributes to the 
Product DNA. 
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12. A nucleic acid molecule according to claim 10, wherein said 
recombination site is derived from at least one recombination site selected from 
the group consisting of attB, attP, attL and attR. 

13. A nucleic acid molecule according to claim 12, wherein said att 
site is selected from the groups consisting of attl, att2 and att3. 

14. A nucleic acid according to claim 10, wherein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) MCX'CWGCTTrYKTRTACNAASTSGB(m-att)(SEQIDNO:l); 

(b) AGCCWGCTTTYKTRTACNAACTSGB (in-attB) (SEQ ID 

NO:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID 

NO:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

NO:4); 

(e) GTTCAGCTTTYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

NO:5); 

and a corresponding or complementary DNA or RNA sequrace, vdierdn R=A or 
G; K=G or T/U; Y=C or T/U; W=A or T/U; N=A or C or G or t/U; S=Cor G; 
and B=C or G cm- T/U. 

15. A nucleic acid according to claim 14, wli^in said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) AGCCTGCl"rii"nGTACAAACTTGT(attBl)(SEQIDNO:6); 

(b) AGCCTGCrTTCTTGTACAAACTrGT(attB2)(SEQIDNO:7); 

(c) AGCCAGCl l lUl rGTACAAACTTGT (attB3) (SEQ ID NO:8); 

(d) GTTGAGCTTTTTTGTACAAACrTGT(attRl)(SEQlbNO.-9); 

(e) GTTCAGCiriCllGTACAAACTTGT(attR2)(SEQIDNO:10); 

(f) GTTCAGCTTTCTTGTACAAAGTTGG (attR3) (SEQ ID 
NO:ll); 
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(g) AGCCTGCT nillGTACAAAGTTGG (attLl) (SEQ ID 
NO:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attL2) (SEQ ID 
NO: 13); 

5 (i) ACCCAGCTTTCTTGTACAAAGTTGG (attL3) (SEQ ID 

NO:14); 

G) GTrCAGCTriTTTGTACAAAGTTGG(attPl) (SEQ ID NO:15); 
(k) GTTCAGCTTTCTTGTACAAAGTTGG (attP2^3) (SEQ ID 
NO:16); 

10 and a coixesponding or complementaiy DNA or RNA sequence. 

16. A method for making a nucleic acid molecule, comprising 
providing a nucleic acid molecule having at least one engineered 

recombination site comprising at least one DNA sequence haying at least 90% 

homology to at least one of SEQ ID NOS:l-16. 

15 17. A nucleic acid molecule provided by a method according to 
claim 16. 

18. A composition, comprising a nucleic acid TnolecuIe according to 
claim 10. 



20 



19. A kit, comprising a contains being compartmaitalized to receive 
in close confinement therein at least one comi>artment^ wherein a first 
compartment contains a composition according to claim 18'. 

20. A kit according to claim 19, further comprising a second 
compartmrnt having at least one recombination protein that recognizes said 
recombination site. 



BNSDCCID: <WO_9640724A1J_> 



10 



15 



WO 96/40724 PCT/US96/10082 

-57- 

21. A kit comprising a container being compartmentalized to receive 
in close confinement therein at least one recombination protein in isolated form, 
useful for a method according to claim 22. 

22. A method of making a Cointegrate DNA molecule, comprising 
combining m v//>-o.- 

(i) an Insert Donor DNA molecule, comprising a desired 
DNA segment flanked by a first recombination site and a 
second recombination site, wherein the firet and second 

recombination sites do not recombine with each other, 
(ii) a Vector Donor DNA molecule containing a third 

recombination site and a fourth recombination site, 

wherein the third and fourth recombination sites do not 

recombine with each othen and 
C«u) at least one site specific recombination protein capable of 

recombining said first and third recombinational sites said 

second and fourth recombinational sites; 

thereby aUowing recombination to occur, so as to produce a Cointegrate DNA 
molecule comprising said first and third or said second and fourth lecombmation 
Mtes. 0 



20 23. A method according to claim 22, 'wdierein a Product DNA 

molecule is produced fix)m said Comtegrate DNA by recombming at leas^ 
0) said first and third, or fri) said second and fourth, recombination sites, said 
Product DNA comprisiiig said desired DNA segment ' ^ 

24. A method according to claim 23, vi*erein said method also 
25 produces a Byproduct DNA molecule. 

25. A method according to claim 23. fimher comprising selecting for 
the Product DNA molecule. 
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26. A method as claimed in claim 22, wherein the Vector Donor DNA 
molecule comprises a vector segmrait flanked by said third and the fourth 
recombination sites. 

27. A method as claimed in claim 22, wherein the Vector Donor DNA 
molecule finther comprises (a) a toxic gene and (b) a Selectable marker, wherein 
the toxic gene and the Selectable maricer are on different DNA segments, the 
DNA segments being separated either by (i) in a circular DNA molecule, two 
recombination sites, or (ii) in a linear DNA molecule, one recombination site. 

28. A method as claimed in claim 22, wherein the Vector Donor DNA 
molecule further comprises (a) a repression cassette and (b) a Selectable marker, 
repressed by the repressor of the repressi<Hi cassette, and wherein the Selectable 
marker and the repression cassette are on different DNA segments, the DNA 
segments bdng separated either by, (i) in a circular DNA molecule, two 
recombination sites, or ^i) in a linear DNA molecule, one recombination site. 

29. AmedK)dasclaimedindaim22,v^ereinatleastoneoflhehisert 
Donor DNA molecule and the Vector Donor DNA molecule is a circular DNA 
molecule. * 

30. A method as claimed in claim 22, v»*erein at least one of the Ins«t 
Do^or DNA molecule and the Vector Donor DNA molecule is a linear DNA 
molecule. 

3 1. A method as claimed in claim 22, vt^erein the selecting stq> is 
carried; out ui v//>-a of in vfve>. 

32. A method as claimed in claim 22, wherein said recombination 
protein comprises at least a first recombination protein and a second 
recombination protein, said second recombination protean befaig diflfeiait fiom 
said first recombination jxrotein. 
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33. A method as claimed in claim 22, wherein said recombination 
protein is Int. 



34. A method as claimed in claim 22, wherein the at least 
recombination protein is selected fiom (i) Int and IHF and 00 Int, Xis. and IHF. 



one 
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Recombinational Cloning Using 
Engineered Recombination Sites 



Background of the Invention 
Field of the Invention 

Th^ present invention relates to recombinant DNA technology. DNA and 
vectors having engineered recombination sites are provided for use in a 
recombinational cloning method that aiables efficient and specific recombination 
of DNA segments using recombmation protems. The DNAs, vectors and 
methods are useftil for a variety of DNA exchanges, such as subcloning of DNA, 
15 in vitro or in vivo. 

RelatedArt 



Site specific recomhinases. Site specific recombinases are enzymes that 
are present in some viruses and bacteria and have been characterized to have both 
endonuclease and ligase properties. These recombinases (along with associated 
proteins in some cases) recognize specific sequences of bases in DNA and 
exchange the DNA segments flankmg those segments. The lecombmases and 
associated proteins are coHectively referred to as "recombination proteins" {see, 
e.g.,, Landy, A., Current Opinion in Biotechnology 3:699-101 (1993)). 

Numerous recombination systems from various organisms have been 
desoribed. See, e.g, Hoess et al. Nucleic Acids Research 14{6y22Zl (1986); 
Abremski et al, J, Biol Chem,26 J {1)1391 (1986); Campbell, J. 
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Bacteriol 77^(23):7495 (1992); Qian etal.J. Biol Ghent 267(1 1):7794 (1992); 
Araki et aL,J. Mol Biol 225(1):25 (1992); Maeser and Kahnmann (1991) MoL 
Gen Genet 250:170-176). 

Many of these belong to the integrase family of recombinases 
5 {AxgoseiaL EMBOJ. 5:433-440(1986)). Perhaps the best studied of these are 

the Integrase/a/f system from bacteriophage X (Landy, A. Current Opinions in 
Genetics cmd Devel 5:699-707 (1993)), the CrdloxP system from bacteriophage 
PI (Hoess and Abremski (1990) In Nucleic Acids and Molecular Biology^ vol. 4. 
Eds.: Eckstein and Lilley, Berlin-Heidelberg: Springer-Verlag; pp. 90-109) , and 
10 the FLP/FRT system from the Saccharomyces cerevisiae 2 ^ circle plasmid 

(Broach et al Cell 29an-2ZA (1982)). 

While these recombination systems have been characterized for particular 
organisms, the related art has only taught using recombinant DNA flaxiked by 
recombination sites, for in vivo recombination. 
15 Backman (U.S. Patent No. 4,673,640) discloses the in vivo use of A 

recombinase to recombine a protein producing DNA segment by enzymatic site- 
specific recombination using wild-type recombination sites attB and attP. 

Hasan and iSzybalski (Gene 55:145-151 (1987)) discloses the use of A Int 
recombinase in vivo for intramolecular recombination between wild type attP 
20 and o^^^tes which flank a promoter. Because the orientations of these sites are 

mverted relative to each other, this caxises an irreversible flipping of the promoter 
region relative to the gene of intCTest 

Palazzolo et al Gene 88:25-36 (1990), discloses phage lambda vectors 
having bacteriophage X arms that contain restriction sites positioned outside a 
25 cloned DNA sequence and between wild-type loxF sites. Infection ofK coli cells 

that expr^ the Cre rcbombmase with these phage vectors results in 
recombination between tte loxP sites and the in viva excision of tfie plasmid 
replicon, includmg the cloned cDNA. 

Posfei et al. (Nucl Acids Res, 22:2392-2398 (1994)) discloses a method 
30 for inserting into genomic DNA partial expression vectors having a selectable 

inarkCT, flanked by two wild-^FRT recognition sequel FLP site-specific 
recombinase as present in the cells is used to integrate Ae vectors into the 
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genome at predetermined sites. Under conditions where the replicon is 
functional, this cloned genomic DNA can be amplified. 

Bebee et al (U.S. Patent No. 5,434,066) discloses the use of site-specific 
recombinases such as Cre for DNA containing two loxP sites is used for in vivo 
5 recombination between the sites. 

Boyd {Nucl Acids Res, 2J:il7-i2l (1993)) discloses a method to 
facilitate the cloning of blunt-ended DNA using conditions that encourage 
intennolecular ligation to a dephosphorylated vector that contains a wild-type 
loxP site acted upon by a Cre site-specific recombinase present in £ coli host 
10 cells. 

Waterhouse etaL (PCTNo. 93/19172 Nucleic Acids Res, 21 (9)2265 
(1993)) disclose an in vivo method where light and heavy chains of a particular 
antibody were cloned in different phage vectors between loxP and loxP 511 sites 
and used to transfect new E, coli cells. Cre, acting in the host cells on the two 
15 parental molecules (one plasmid, one phage), produced four products in 

equilibrium: two different cointegrates (produced by recombination at either loxP 
or loxP 51 1 sites), and two daughter molecules, one of which was the desired 
product 

In contrast to the other related art, Schlake & Bode {Biochemistry 
20 53:12746-12751 (1994)) discloses an in vivo method to e^hange expression 

cassettes at defined chromosomal locations, each flanked by a wild type and a 
spacer-mutated FRT recombination site. A double-reciprocal crossover was 
mediated m cultured mammalian cells by using this FLP/FRT system for site- 
specific recombination. 

25 Transposases. The family of enzymes, the transposases, has also been 

used to transfer genetic information between replicons. Transposons are 
structurally variable, being described as simple or compound, but typically 
encode the recombinase gene flanked by DNA sequences organized in inverted 
orientations. Integration of transposons can be random or highly specific. 

30 Representatives such as Tn7, which are highly site-specific, have been applied to 

the w vivo movement of DNA segments between replicons (Lucklow et al.^ 
J. ViroL 57:4566-4579(1993)). 
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Devine and Boeke Nucl Acids Res, 22:3765-3772 (1994), discloses the 
construction of artificial transposons for the insertion of DNA segments, in vitro, 
into recipient DNA molecules. The system makes use of the integrase of yeast 
TYl virus-like particles. The DNA segment of interest is cloned, using standard 
5 methods, between the ends of the transposon-like element TYl . In the presence 

of the TYl integrase, the resulting element integrates randomly into a second 
target DNA molecule. 

DNA cloning. The cloning of DNA segments currmtly occurs as a daily 
routine in many research labs and as a prerequisite irtep in many genetic analyses. 
10 The purpose of these clonings is various, however, two general purposes can be 

considered: (1) the initial cloning of DNA from large DNA or RNA segments 
(chromosomes, YACs, PGR fragments, mRNA, etc.), done in a relative handful 
of known vectors such as pUC, pGem, pBlueScript, and (2) the subcloning of 
these DNA segments into specialized vectors for functional analysis. A great 
15 deal of time and effort is expended both in the initial cloning of DNA segments 

and in the transfer of DNA segments fix>m the initial cloning vectors to the more 
specialized vectors. This transfer is called subcloning. 

The basic methods for cloning have been known for many years and have 
changed little during that time. A typical cloning protocol is as follows: 
20 (1) digest flie DNA of interest with oi;^ or two restriction 

enzymes; 

(2) gel purify the DNA segment of int«:est ^en known; 

(3) prepare the vector by cutting with appropriate restriction 
enzymes, treating with alkaline phosphatase, gel purify etc., as 

25 appropriate; 

(4) ligate the DNA segment to vector, with ai^>ropriate controls 

to estimate backgn>und of uncut and self-ligated vectbn 

(5) introduce the resulting vector into an K co/i host ceU; 

(6) pick selected colonies and ^w small cultures overnight; 
30 (7) make DNA mini{nreps; and 

(8) analyze the isolated plasimd on agarose gels (bfte»i after 
diagnostic restriction enzyme digestions) or by PGR. 
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The specialized vectors used for subcloning DNA segments are 
functionally diverse. These include but are not limited to: vectors for expressing 
genes in various organisms; for regulating gene expression; for providing tags to 
aid in protein purification or to allow tracking of proteins in cells; for modifying 
the cloned DNA segment (e,g., generating deletions); for the synthesis of probes 
(e.g,y riboprobes); for the preparation of templates for DNA sequencing; for the 
identification of protein coding regions; for the fusion of various protein-coding 
regions; to provide large amounts of the DNA of interest, etc. It is common that 
a particular investigation will involve subcloning the DNA segment of interest 
into several different specialized vectors. 

As known in the art, simple subclonings can be done intone day (e,^., the 
DNA segment is not large and the restriction sites are compatible with those of 
the subcloning vector). However, many other subclonmgs can take several 
weeks, especially those involving unknown sequences, long fragments, toxic 
genes, unsuitable placement of restriction sites, high backgrounds, impure 
enzymes, etc. Subclonmg DNA fi:agments is thus often viewed as a chore to be 
done as few times as possible. 

Several methods for facilitating the cloning of DNA segments have been 
described, e,g., as in the following references. 

Ferguson, J., eiaL Gene 16:191 (1981), discloses a J^ly of vectors for 
subcloning fragments of yeast DNA. The vectors encode kanamycin resistance. 
Clones of longer yeast DNA segmrats can be partially digested and ligated into 
the subclonmg vectors. If the original cloning vector conveys resistance to 
ampicillin, no purification is necessary prior to transformation, since the selection 
will be for kanamycin. 

HasMmoto-Gotoh, T., ei aL Gene 41:125 (1986), discloses a subcloning 
vector with unique cloning sites within a streptomycin saisitivity gene; in a 
streptomycin-resistant host, only plasnaids with inserts or deletions m the 
dominant sensitivity gene will survive streptomycin selection. 

Accordingly, traditional subcloning methods, using restriction enzymes 
and ligase, are time consuming and relatively unreliable. Considerable labor is 
expended, and if two or more days later the desired subclone can not be found 
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among the candidate plasmids, the entire process must then be repeated with 
alternative conditions attempted. Ahhough site specific recombinases have been 
used to recombine DNA in vivo, the successful use of such enzymes in vitro was 
expected to suffer from several problems. For example, the site specificities and 
5 efficiencies were expected to differ in vitro; topologically-linked products were 

expected; and the topology of the DNA substrates and recombination proteins 
was expected to difiFer significantly in vitro {see, e.g.^ Adams et al J. MoL 
BioL 225:661-73 (1992)). Reactions that could go on for many hours in vivo 
WCTe expected to occur in significantly less time in vitro before the enzymes 

10 became inactive. Multiple DNA recombination products were expected in the 

biological host used, resulting in unsatisfactory reliability, specificity or 
eflSciency of subcloning. In vitro recombination reactions were not expected to 
be suflBciently efficient to yield the desired levels of product. 

Accordingly, there is a long felt need to provide an alternative subcloning 

15 system that provides advantages over the known use of restriction enzymes and 

ligases. 

Summary of the Invention 

The present invention provides nucleic add, vectors and methods for 
obtaining chirheric nucleic acid using recombination proteins and engineered 

20 recombination dtes, 7/1 vi/ro or m vfvo. These methods are highly specific, rapid, 

and less labor intensive than what is disclosed or suggested in the related 
baickground art The improved specificity, speed and yields of the present 
invention &cilitates DNA or RNA subdoning, regulation or exchange useful for 
any related puEpbse. Such purposes include in vitro recombination of DNA 

25 segments and in vitro or in vivo insertion or modification of transcribed, 

rq>Iicated, isolated or genomic DNA or RNA. 

The present invention relates to nucldc adds, vectors and methods for 
moving of exchanging s^ments of DNA using at least one engineered 
recombination site and at least one recombination protein to provide diimeric 

30 DNA molecules which have the desired characteristic(s) and/or DNA segmmt(s). 
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Generally, one or more parent DNA molecules are recombincd to give one or 
more daughter molecules, at least one of which is the desired Product DNA 
segment or vector. The invention thus relates to DNA, RNA, vectors and 
methods to effect the exchange and/or to select for one or more desired products. 

One embodiment of the present invention relates to a method of making 
chimeric DNA, which comprises 

(a) combining in vitro or in vivo 

(i) an Insert Donor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites do not recombine with each 
other; 

(ii) a Vector Donor DNA molecule containing a third 
recombination site and a fourth recombination site, wherein the third and fourth 
recombination sites do not recombine with each other; and 

Oii) one or more site specific recombination proteins capable 
of recombining the first and third recombinational sites and/or the second and 
fourth recombinational sites; 

thereby allowing recombination to occur, so as to produce at least one 
Cointegrate DNA molecule, at least one desired Product DNA molecule which 
20 comprises said deshred DNA segment, and optionally^a Byproduct DNA 

molecule; and then, optionally, 

(b) selecting for the Product or Byproduct DNA molecule. 
Another embodiment of the present invention relates to a kit comprising 
a carrier or receptacle being compartmentalized to receive and hold therein at 
25 least one contamer, wiierein a first container contains a DNA molecule 

comprising a vector having at least two recombination sites flanking a cloning 
site or a Selectable marker, as described herein. The kit optionaUy fiirther 
comprises: 

(i) a second container contaming a Vector Donor plasmid 
30 comprising a subcloning vector and/or a Selectable marker of vt^ch one 

or both are flanked by one or more mgineered recombination sites; and/or 
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(ii) a third container containing at least one recombination protein 
which recognizes and is capable of recombining at least one of said 
recombination sites. 

Other embodiments include DNA and vectors useful in the methods of the 
present invention. In particular. Vector Donor molecules are provided in one 
embodiment, wAerein DNA segments within the Vector Donor are separated 
either by, (i) in a circular Vector Donor, at least two recombination sites, or (ii) 
in a hnear Vector Donor, at least one recombination site, where the recombination 
sites are preferably engineered to enhance specificity or efficiency of 
recombinatioiL 

One Vector Donor embodiment comprises a first DNA segmmt and a 
second DNA segment, the first or second segment comprising a Selectable 
maricer. A second Vector Donor embodiment comprises a first DNA segment 
and a second DNA segment, the first or second DNA segment comprising a toxic 
gene. A third Vector Donor embodiment comprises a first DNA segment and a 
second DNA segment, the first or second DNA segment comprising an inactive 
firagm^t of at least one Selectable marker, wherein the inactive fiagment of the 
Selectable maricer is capable of reconstituting a fimctional Selectable marker 
A^en recombined across the first or second recombination site with another 
inactive fiagment of at least one Selectable marker. 

The present recombinational cloning method possesses several advantages 
ovCTpreviouswvfvomelhods. Since smgle molecules of recombination products 
can be introduced into a biological host, propagation of the desired Product DNA 
in the absence of other DNA molecules {e.g,, starting molecules, intermediates, 
and by-products) is more readily realized. Reaction conditions can be fteely 
adjusted /n vitro to optimize m^roe activities. DNA molecules can be 
incompatible with the desired biological host (e:g,, YACs, genomic DNA, etc), 

can be used. Recombination proteins from diverse sources can be en^ldyed, 
together or sequCTctially. 

04er embodiments will be evident to those of ordinary skill in tiie art 
finom the teadiings contained hoem in combination with -what is known to tiie art 
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Brief Description of the Figures 

Figure 1 depicts one general method of the present invention, wherein the 
starting (parent) DNA molecules can be circular or linear. The goal is to 
exchange the new subcloning vector D for the original cloning vector B. It is 
desirable in one embodiment to select for AD and against all the other molecules, 
including the Cointegrate. The square and circle are sites of recombination: e.g. , 
loxP sites, att sites, etc. For example, segment D can contain expression signals, 
new drug markers, new origins of replication, or specialized functions for 
mapping or sequencing DNA. 

Figure 2A depicts an in vitro method of recombining an Insert Donor 
plasmid (here, pEZC705) with a Vector Donor plasmid (here, pEZC726), and 
obtaming Product DNA and Byproduct daughter molecules. The two 
recombmation sites are attP and loxP on the Vector Donor. On one segment 
defined by these sites is a kanamycin resistance gene whose promoter has been 
replaced by the tetOP operator/promoter from transposon 7>il0: See Sizemore 
et a/., Nuch Acids Res, 7*(10):2875 (1990). In the absence of tet repressor 
protein, R coli RNA polymerase transcribes the kanamycin resistance gene from 
the ietOP. If tet repressor is present, it binds to tetOP and blocks transcription of 
the kanamycm resistance gene. The other segment of pKZC726 has the tet 
repressor gene expressed by a constitutive promoter. Thus cells transformed by 
pEZC726 are resistant to chloramphenicol, because of die chlorampheiucol acetyl 
transferase gene on the same segment as tetR, but are sensitive to kanamycin. 
The recombinase-mediated reactions result in separation of the tetR gene from the 
regulated kanamycin resistance gaie. This separation results in kanamycin 
resistance in cells receiving only the desired recombination products. The first 
recombination reaction is driven by the addition of the recombihase called 
Integrase. The second recombmation reaction is driven by adding the 
recombinase Cre to the Cointegrate (here, pEZC7 Cointegrate). 
Figure 2B depicts a restriction map of pEZC705. 
Figure 2C depicts a restriction map of pEZC726. 
Figure 2D depicts a restriction map of pEZC7 Cointegrate. 
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Figure 2E depicts a restriction map of Intprod. 
Figure 2F depicts a restriction map of Intbypro. 

Figure 3A depicts an in vitro method of recombining an Insert Donor 
plasmid (here, pEZC602) with a Vector Donor plasmid (here, pEZC629), and 
obtainbg Product (here, EZC6prod) and Byproduct (here, EZC6Bypr) daughter 
molecules. The two recombination sites are /oxP and /axP 5/7. One segment of 
pEZC629 defined by these sites is a kanamycin resistance gene whose promoter 
has been replaced by the tetOP operator^romoter from transposon TnlO. In the 
absence of tet repressor protein, K coli RNA polymerase transcribes the 
kanamycin resistance gene from the tetOP. If tet repressor is present, it binds to 
tetOP and blocks transcription of the kanamycin resistance gene. The other 
segment of pEZC629 has the tet repressor gene expressed by a constitutive 
promoter. Thus cells transformed by pEZC629 are resistant to chloramphenicol, 
because of the chloramphenicol a^^etyl transferase gene on the same segment as 
tetR^ but are sensitive to kanamycin. The reactions result in separation of the tetR 
gene from the regulated kanamycin resistance gene. This separation results in 
kanamycin resistance in cells receiving the desired recombination product The 
first and the second recombination events are driven by the addition of the same 
recombinase, Cre. 

Figure 3B depicts a restriction map of EZC6Bypr^ 

Figure 3C depicts a restriction m^ of EZC6prod. 

Figure 3D dq)icts a restriction map of pEZC602. 

Figure 3E depicts a restriction map of pEZC629. 

Figure 3F depicts a restriction map of EZC6coinL 

Figure 4A depicts an application of the m vitro method of 
recombinational cloning to subclcme tiie chloramph^col acetyl tran5f«:ase grae 
into a vector for expression in eukaiyotic cells. The Insol Donor plasmid, 
pEZC843, is comprised of the chloramphenicol acetyl transferase gene ofK coli, 
cloned between /oxP and orr^ sites such that the /orP site is po 
5'-end of the gene. The Vector Donor plasmid, pEZC1003, contains the 
cytomegalovirus eukaryotic promoter ^jposed to a loxP site. The si9>afcoiled 
plasmids were combined with lambda Integrase and Cre recombinase in vitro. 
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After incubation, competent E. coli cells were transformed with the 
recombinational reaction solution. Aliquots of transformations were spread on 
agar plates containing kanamycin to select for the Product molecule (here 
CMVProd). 

5 Figure 4B depicts a restriction map of pEZC843. 

Figure 4C depicts a restriction map of pEZC 1 003. 
Figure 4D depicts a restriction map of CMVBypro. 
Figure 4E depicts a restriction map of CMVProd. 
Figure 4F depicts a restriction map of CMVcoint 
10 Figure 5A depicts a vector diagram of pEZCl 301 . 

Figure 5B depicts a vector diagram of pEZCl 305. 
Figure 5C depicts a vector diagram of pEZC1309. 
Figure 5D depicts a vector diagram of pEZC1313. 
Figure 5E depicts a vector diagram of pEZC1317. 
15 Figure 5F depicts a vector diagram of pEZC1321. 

Figure 5G depicts a vector diagram of pEZCl 405, 
Figure 5H depicts a vector diagram of pEZC1502. 
Figure 6A depicts a vector diagram of pEZC 1 603. 
figure 6B depicts a vector diagram of pEZCl 706. 
^0 Figure 7A depicts a vector diagram of pEZC2901.^ 

Figure 7B depicts a vector diagram of pEZC29 1 3 
Figure 7C depicts a vector diagram of pEZC3 101. 
Figure 7D depicts a vector diagram of pEZCl 802. 
Figure 8A depicts a vector diagram of pGEX-2TK. 
25 Figure 8B depicts a vector diagram of pEZC3501. 

Figure 8C depicts a vector diagram of pEZC3601. 
Figure 8D depicts a vector diagram of pEZC3609. 
Figure 8E depicts a vector diagram of pEZC3617. 
Figure 8F depicts a vector diagram of pEZC3606. 
^® Figure 8G depicts a vector diagram of pEZC3613. 

Figure 8H depicts a vector diagram of pEZC362L 
Figure 81 depicts a vector diagram of GST-CAT. 
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Figure 8J depicts a vector diagram of GST-phoA. 
Figure 8K depicts a vector diagram of pEZC3201 . 

Detailed Description of the Preferred Embodiments 

It is unexpectedly discovered in the present invention that subcloning 
5 reactions can be provided using recombinational cloning. Recombination cloning 

according to the present invention uses DNAs, vectors and methods, in vitro and 
in vivo, for moving or exchanging segments of DNA molecules using engineered 
recombination sites and recombination proteins. These methods provide chimoic 
DNA molecules that have the desired characteristic(s) and/or DNA segment(s). 

10 The present invention thus provides nucleic acid, vectors and methods for 

obtaining chimeric nucleic acid using recombination proteins and engineered 
recombination sites, in vitro or in vivo. These methods are highly specific, rapid, 
and less labor intensive than \\iiat is disclosed or suggested in the related 
backgrotmd art. The improved specificity, speed and yields of the present 

15 invention &cilitates DNA or RNA subcloning, regulation or exchange usefiil for 

any related purpose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vivo insertion or modification of transcribed, 
replicated, isolated or genomic DNA or RNA. 0 

Definitions 

20 In the description that follows, a nimiber of terms used in recombinant 

DNA technology are utilized extensively. In ord^ to provide a clear and 
consistent understandinjg; of the specification and claims, including the scope to 
be given such terms, the following definitions are provided. 

Byproduct: is a daughter molecule (a new clone produced after the 

25 second recombination event during the recombinational cloning process) lacking 

the DNA \^ch is desired to be subcloned. 
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Cointegrate: is at least one recombination intermediate DNA molecule 
of the present invention that contains both parental (starting) DNA molecules. 
It will usually be circular. In some embodiments it can be linear. 

Host: is any prokaryotic or eukaryotic organism that can be a recipient 
of the recombinational cloning Product. A "host," as the term is used herein, 
includes prokaryotic or eukaiyotic organisms that can be genetically engineered. 
For examples of such hosts, see Maniatis et aL, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New Yoric (1982). 

Insert: is the desked DNA segment (segmental of Figure 1) vMch one 

wishes to manipulate by the method ofthe present invention. The insert can have 
one or more genes. 

Insert Donor: is one of flie two parental DNA molecules ofthe present 
invention vAnch carries the Insert The Insert Donor DNA molecule comprises 
the Insert flanked on both sides with recombination signals. The Insert Donor 
can be linear or ckcular. In one embodunent ofthe invention, the Insert Donor 
is a circular DNA molecule and fiirther comprises a cloning vector sequence 
outside ofthe recombination signals (see Figure 1). 

Product: is one or both the desired daughter molecules comprising the A 
and 2> or J? and C sequences v^iiich are produced after the second recombination 
event during the recombinational cloning process (see Figwe 1). Hie Product 
contains the DNA which was to be cloned or subcloned. 

Promoter: is a DNA sequrace generally described as the 5'-region of a 
gene, located proximal to the start codon. The transcription of an adjacent DNA 
segment is initiated at the promoter region. A repressible promoter's rate of 
transcription decreases in response to a r^jressing agent An inducible promoter's 
rate of transcription increases in response to an inducing agent A ^constitutive 
promoter's rate of transcription is not specifically regulated, though it can vary 
under the influence of general metabolic conditions. 
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Recognition sequence: Recognition sequences are particular DNA 
sequences which a protein, DNA, or RNA molecule (e,g,, restriction 
endonuclease, a modification methylase, or a recombinase) recognizes and binds. 
For example, the recognition sequence for Cre recombinase is /oxP which is a 34 
5 base pair sequence comprised of two 13 base pair inverted repeats (serving as the 

recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 
of Sauer, B., Ctarent Opinion in Biotechnology 5:521-527 (1994). Other 
examples of recognition sequences are the attB, attP, dttL, and attR sequences 
which are recognized by the recombinase enzyme X Integrase. attB is an 
10 approximately 25 base pair sequence containing two 9 base pair core-type Int 

binding sites and a 7 base pair overlap region. attP is an a^yproximately 240 base 
pair sequence containing core-type Int binding sites and arm-type Int binding 
sites as well as sites for auxiliary proteins IHF, FIS, and Xis. See Landy, Current 
Opinion in Biotechnology 3:699-707 (1993). Such sites arc also engineered 
15 according to the present invention to enhance methods and products. 

Recombinase: is an enzyme vMch catalyses the exchange of DNA 
segments at specific recombination sites. 

Recombinational Cloning: is a method described herein, whereby 
segments of DNA molecules are exchanged, inserted, replaced, substituted or 
20 modified, m v/fro or i« vrvo. ' 

Recombination proteins: include excisive or integrative proteins, 
enzymes, co-factors or associated proteins that are involved in recombination 
reactions involving one or morc recombination sites. See, Landy (1 994), injra. 
Repression cassette: is a DNA segment that contains a repressor of a 
25 Selectable marker present in the subcloning vector. 

Selectable marker: is a DNA segment that allows one to select for or 
against a molecule or a cell that contams it, often under particular conditions. 
These markers can encode an activity, such as, but not limited to, prodtiction of 
RNA, peptide, or protein, or can provide a binding site for RNA, peptides, 
proteins, inorganic and organic compounds or compositions and the like. 
Examples of Selectable markers include but are not limited to: (1) DNA 
segments that encode products \^ch provide resistance against otherwise toxic 
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compoimds {e.g., antibiotics); (2) DNA segments that encode products vAuch are 
otherwise lacking in the recipient cell (e.g., tRNA genes, auxotrophic markers); 
(3) DNA segments that encode products which suppress the activity of a gene 
product; (4) DNA segments that encode products v^ch can be readily identified 
5 {e.g., phenolic markers such as P-galactosidase, green fluorescent protein 

(GFPX and cell suifece i»oteins); (5) DNA segments that bind products which are 
otherwise detrimental to cell survival and/or function; (6) DNA segments that 
otherwise inhibit the activity of any of the DNA segments described in Nos. 1 -5 
above {e.g., antisense oKgonucleotides); (7) DNA segments that bind products 
10 tl»at modify a substrate restriction endonucleases); (8) DNA segments that 

can be used to isolate a desired molecule {e.g. specific protein binding sites); (9) 
DNA segments that encode a qjecific nucleotide sequence which can be 
otherwise non-functional {e.g., for PGR amplification of subpopulations of 
molecules); and/or (10) DNA segments, which when absent, directly or indirectly 
15 confer sensitivity to particular compounds. 

Selection scheme: is any method which allows selection, enrichment, or 
identification of a desired Product or Product(s) fiom a mixture containing the 
Insert Donor, Vector Donor, and/or any intermediates, {e.g. a Cointegrate) 
Byproducts. The selection schemes of one prefenred embodiment have at least 
20 two components that are either linked or unlinked dur^g recombinational 

cloinng. One component is a Selectable marker. The other component controls 
the expression in vitro oi in vivo of the Selectable marker, or survival of the cell 
harboring the plasmidcanying the Selectable marker. Generally, this controlling 
element will be a repressor or inducer of the Selectable marker, but other means 
25 for controlling expression ofthe Selectable marker can be used. Whether a 

repressor or activator is used wiU depend on whether the maricer is for a positive 
or negative selection, and the exact arrangement of the various DNA segments, 
as WiU be readfly apparent'to those skiUed in the art A preferred requirement is 
that the selection scheme results in selection of or enrichment for only one or 
more desired Products. As defined herein, to select for a DNA molecule includes 
(a) selecting or enriching for the presence of the desired DNA molecule, and (b) 
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selecting or enriching against the presence of DNA molecules that are not the 
desired DNA molecule. 

In one embodiment, the selection schemes (^^ch can be carried out 
reversed) will take one of three forms, ^^ch will be discu^ed in terms of 
Figure 1 . The first, exCTiplified herein with a Selectable marker and a repressor 
therefor, selects for molecules having segment D and lacking segment C. The 
second selects against molecules having segment C and for molecules having 
segment 2>. Possible embodiments of the second form would have a DNA 
segment carrying a gene toxic to cells into which the in vitro reaction products 
are to be introduced. A toxic gene can be a DNA that is esqnressed as a toxic gene 
product (a toxic protein or RNA), or can be toxic in and of itself. (In the latter 
case, the toxic gene is understood to cany its classical definition of "heritable 
trait".) 

Examples of such toxic gene products are well known in the art, and 
include, but are not limited to, restriction endonucleases (c-g:, Cpnl) and genes 
that kill hosts in the absence of a siq)pressing function, eg., kicB, A toxic gene 
can alternatively be selectable in vitro ^ e.g, , a restriction site. 

In the second form, segment D carries a Selectable marker. The toxic 
gene would elinoinate transformants haibcHing the Vector Donor, Cointegmte, and 
Byproduct molecules, while the Selectable maricer can be to select for cells 
containing die Product and against celLs harboring only the Insert Donor. 

The third form selects for cells that have bodisegments>4 andDiacison 
the same molec^e, but not for cells that have bodi segments in trcms on different 
molecules. This could embodied by a Selectable marker that is split into two 
inactive fiagmoits, one each on ^gments ^ and />. 

The fiagments are so arranged relative to the recombination sites that 
^en the segments are brought together by the recombination event, they 
reconstitute a fimctional Selectable marker. For example, tfie recombinatior^ 
event can link a promote with a structural gene, can link two firagments of a 
structural gene, or can link genes tiiat «c6de a heterodimeric gene product 
needed for survival, or can link portions of a replicon. 
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Site-specific recombinase: is a type of recombinase which typically has 
at least the following four activities: (1) recognition of one or two specific DMA 
sequences; (2) cleavage of said DNA sequence or sequences; (3) DNA 
topoisomerase activity involved in strand exchange; and (4) DNA ligase activity 
5 to reseal the cleaved strands of DNA. See Sauer, B., Current Opinions in 

Biotechnology 5:521-527 (1994). Conservative site-specific recombination is 
distinguished from homologous recombination and transposition by a high degree 
of specificity for both partners. The strand exchange mechanism involves the 
cleavage and rejoining of spedfic DNA sequences in the absence of DNA 
10 synthesis (Landy, A. (1989) Ann. Rev. Biochem. 55:913-949). 

Subcloning vector: is a cloning vector comprising a circular or linear 
DNA molecule which includes an approimate replicon. In the iweswit invention, 
the subcloning vector (segment D in Figure 1) can also contain functional and/or 
regulatory elements that are desired to be incorporated into the final product to 
15 act upon or with the cloned DNA Insert (segmenty4 in Figure 1). The subcloning 

vector can also contain a Selectable marker (contained in segment C in Figure 1). 

Vector: is a DNA that provides a useful biological or biochemical 
property to an Insert Examples include plasmids, phages, and oAer DNA 
sequences which are able to rq)Kcate or be repUcated in vitro or in a host cell, or 
20 to convey a desired DNA segment to a desired location \njthin a host cell. A 

Vector can have one or more restriction endonuclease recognition sites at which 
the DNA sequences can be cut in a detenninable fashion widiout loss of an 
essential biological function of the vector, and into which a DNA fragment can 
be spliced in order to bring about its replication and cloning. Vectors can fiirther 
25 provide primer sites, e.g., for PCR, transcriptional and/or translational initiation 

and/or regulation sites, recombinational signals, replicons. Selectable maricers, 
etc. Cleariy, metlK)ds of inserting a desired DNA fiiagment vtluch do not require 
the use of homologous recombination or restriction eiizymes (such as, but not 
limited to, UDG cloning of PCR fragments (U.S. Patent No. 5334,575, entirely 
30 incorporated herein by reference), T:A cloning, and the like) can also be appKed 

to clone a fragment of DNA into a cloning vector to be used according to the 
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present invention. The cloning vector can further contain a Selectable maiicer 
suitable for use in the identification of cells transformed with the cloning vector. 

Vector Donor: is one of the two parental DNA molecules of the present 
invention which carries the DNA segments encoding the DNA vector \\1iich is to 
5 become part of the desired Product The Vector Donor comprises a subcloning 

vector D (or it can be called the cloning vector if the Insert Donor does not 
already contain a cloning vector) and a segment C flanked by recombination sites 
(see Figure 1). Segments C and/or D can contain elements that contribute to 
selection for the desired Product daughter molecule, as described above for 
10 selection schemes. The recombination signals can be the same or different, and 

can be acted iqx>n by the same or different recombinases. In addition, the Vector 
Donor can be linear or circular. 

Description 

One general scheme for an in vitro or in vivo method of the invention is 
15 shown in Figure 1 , \^ere the Insert Donor and the Vector Donor can be either 

circular or linear DNA, but is shown as circular. Vector I> is exchanged for the 
original cloning vector A, It is desirable to select for the daughter vector 
containing elementsyi and2> and against other moIecules,including one or more 
Cointegrate(s). The square and circle are different sets of recombination sites 
20 (e.g., lox sites or att sites). Segments or D can contain at least one Selection 

Marker, e3q>ression signals, origins of replication, or specialized functions for 
detecting, selecting, e?q>ressing, mapping or sequencing DNA, where D is used 
in this example. 

Examples of desired DNA segments that can be part of Eldnent or D 
25 include, but are not limited to, PGR products, large DNA segments, genomic 

clones or firagments, cDNA clones, functional elements, e/c, and genes or partial 
genes, \^ch encode usefiil nucleic acids or proteins. Moreover, the 
recombinational cloning of the present invention can be used to make ex vivo and 
in vivo gene transfer vehicles for protein expression and/or gene therapy. 
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In Figure 1, the scheme provides the desired Product as containing 
vectors D and A, as follows. The Insert Donor (containing A and B) is first 
recombined at the square recombination sites by recombination proteins, with the 
Vector Donor (containing C and 2>), to form a Co-integrate having each of A-D- 
5 C-B. Next, recombmation occurs at the circle recombination sites to form 

Product DNA (4 and 2>) and Byproduct DNA (C and B). However, if desired, 
two or more different Co-integrates can be formed to generate two or more 
Products. 

In one embodiment of the present in vitro or in vivo recombinational 
10 cloning method, a method for selecting at least one desired Product DNA is 

provided. This can be understood by consideration of the map of plasmid 
pEZC726 dqjicted in Figure 2. The two exemplary recombination sites are oT/P 
and /oxi>. On one segment defined by these sites is a kanamycin resistance gene 
whose promoter has been replaced by the te/OP operator/promoter from 
15 transposon TwlO. In the absence of tet repressor protein. E coHKNA polymerase 

transcribes the kanamycin resistance gene fix)m the tetOP. If tet repressor is 
present, it binds to tetOP and blocks transcription of the kanamycin resistance 
gene. The other segment of pEZC726 has the tet repressor gene expressed by a 
constitutive promoter. Thus cells transformed by pEZC726 are resistant to 
20 diloramphenicol, because of the chloramphenicol acetyl transferase gene on the 

same segment as tetR, but are sensitive to kanamycin. The recombination 
reactions result in separation of the gene fixjm the regulated kanamycin 

resistance gene. This separation results in kanamyciniBsistance in cells recdving 
the desired recombination Product. 

different sets of plasmids were constructed to demonstrate the 
in vitro method. One set, for use with Ore lecombinase only (cloning vector 602 
and subcloning vector 629 (Figure 3)) contained hxP and loxP 511 sites. A 
second set, for use with Cre and integrase (clonmg vector 705 and subcloning 
vector 726 (Figure 2)) contained /oxP and a» sites. The efficiency of production 
30 of ^edesared daughter plasmid vvas about 60 fold higher using both enzymes 

than usmg Ore alone. Nineteen of twenty four colonies fit>m Ae Cre-only 
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leaction contained the desired product, while thirty eight of thirty eight colonies 
from the integrase plus Cre reaction contained the desired product plasmid. 

Other Selection Schemes A variety of selection schemes can be used that 
are known in the art as they can suit a particular purpose for which the 
5 recombinational cloning is carried out Depending upon individual preferences 

and needs, a number of different types of selection schemes can be used in the 
recombinational cloning method of the present invention. The skilled artisan can 
take advantage of the availability of the many DNA segments or methods for 
making them and the difTermt methods of selection that are routinely used in the 

10 art Such DNA segments include but are not limited to those which encodes an 

activity such as, but not limited to, production of RNA, peptide, or protein, or 
providing a binding site for such RNA, peptide, or protein. Examples of DNA 
moleciiles viscd in devising a selection scheme are given above, under the 
definition of "selection scheme" 

1 S Additional examples include but are not limited to: 

(i) Generation of new primer sites for PGR (e.g., juxta^sition of two 
DNA sequences that were not previously juxtaposed); 

(ii) Inclusion of a DNA sequence acted upon by a restriction 
endonuclease or other DNA modifying enzyme, chemical, 

20 ribozyme, etc.; ^ 

(iii) Inclusion of a DNA sequence recognized by a DNA binding 
protein, RNA, DNA, chemical, etc.) for use as an afiBnity 
tag for selecting for or excluding from a population) (I>avis, M/c/. 
Acids Res. 24:702-706 (1996); J. 8027-8034 (1995)); 

25 (W) //I viTro selection ofRNAligands for the ribosomalL22 protein 

associated with Epstein-Barr virus-ex{Hressed RNA by using 
landomized and cDNA-dmved RNA libraries; 
(vi) The positioning of functional elements v^ose activity requires a 
specific orimtation or juxtaposition (e.^., (a) a recombination site 

30 which reacts pooriy in trans, but vAen placed in in the 

presence of the apjaropi iate proteins, resphs in reg^inKmgitinn th?t 
desbx>ys certain populations of molecule (e.g,, reconstitution of 
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a promoter sequence that allows in vitro RNA synthesis). The 
RNA can be used directly, or can be reverse transcribed to obtain 
the desired DNA construct; 

(vii) Selection of the desired product by size (e.g., fiactionation) or 
5 other physical property of the molecule(s); and 

(viii) Inclusion ofa DNA sequence required for a specific modification 
(e.g., methylation) that allows its identification. 

After fonnation of the Product and Byproduct in the method of the present 
invention, the selection step can be carried out either in vitro or in vivo depending 
10 upon the particular selection scheme which has been optionally devised in the 

particular recombinational cloning procedure. 

For example, an in vitro method of selection can be devised for the Insert 
Donor and Vector Donor DNA molecules. Such scheme can involve engineering 
a rare restriction site in the starting circular vectors in such a way that after tfie 
15 recombination events the rare cutting sites end up in the Byproduct Hence, when 

the restriction enzyme vAAch binds and cuts at the rare restriction site is added to 
the reaction mixture in vitro, all of the DNA molecules carrying the rare cutting 
site, i.e., the starting DNA molecules, the Comtegrate, and the Byproduct, wiU be 
cut and rendered nonreplicable in the intended host cell. Few example, cutting 
20 sites in segments B and C (see Figure 1) can be used ^ select agamst all 

molecules except the Product Alternatively, only a cutting site in C is needed if 
one is able to select for segment D, eg. , by a drag resistance gme not found on B, 
Similarly, an in vitro selection method can be devised when dealing with 
linear DNA molecules, DNA sequences complementary to a PGR primer 
25 sequence can be so engmeered that they are transferred, through the 

recombinational cloning metiiod, only to die Product molecule. AfW the 
reactions are completed, the ^propriate primes are added to the reaction solution 
and the sample is subjected to PGR. Hence, aU or part of the Product molecule 
is amplified. 

Crtho: mvh^ selection schemes can be used with a variety of £ co^^^ 
lines. Qneistopmarqjressorgeneononesegment oftiiesubcloningplasmid, 
and a drug marker controlled by that repressor on the other segment of the same 
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plasmid. Anotho- is to put a killer gene on segment C of the subcloning plasmid 
(Figure 1). Of course a way must exist for growing such a plasmid, i.e., there 
must exist circumstances under which the killer gene will not kill. There are a 
number of these genes known which require particular strains ofK coli. One 
5 such scheme is to use the restriction enzyme Dpnl^ wiiich will not cleave imless 

its recognition sequence GATC is methylated. Many iK>pular conmion K coli 
strains methylate GATC sequences, but there are mutants in which cloned Dpnl 
can be e7q>ress6d without harm. 

Of course analogous selection schema can be devised for other host 
10 organisms. For example, the tet repressor/operator of TnlO has been ad^ted to 

control gene e3q)ression in eukaryotes (Gossot, M., and Bujard, H., Proc, Natl 
Acad Sci, USA «P:5547-5551 (1992)). Thus the same control of drug resistance 
by the tet repressor exemplified herein can be applied to select for Product in 
eukaryotic cells. 

15 Recombination Proteins 

In the present invention, the exchange of DNA segments is achieved by 
the use of recombination proteins, including recombinases and associated 
co-factors and proteins. Various recombination proteins aij described in the art 
Examples of sudi recombinases include: 

20 Cre: A protein fiom bacteriophage PI (Abrraiski and Hoess, J. Biol 

Ghent 25P(3): 1509-15 14 (1984)) catalyzes the exchange (i.e., causes 
recombination) between 34 bp DNA sequences called /oxP (locus of crossover) 
sites (See Hoess et al, Nucl Acids Res, I4(5):22S7 (1986)). Cre is available 
commercially ^ovagen. Catalog No. 69247-1). Recombination mediated !^ Cre 

25 is fieely rev^ible. From thmnodynamic consid^n^onsl it is not smprising that 

Cre-mediated integration (recombination between two molecides to form one 
molecule) is much less efficient than Cre-mediated excision (recombination 
between two loxP sites in the same molecule to form two daughter molecules). 
Cre works in simple buffers with eidier magnesium or spenni dine as a co&ctor, 

30 as is well known in flie art The DNA substrates can be either linear or 
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supercoiled. A number of miitant loxP sites have been described (Hoess et al , 
supra). One of these, loxP 51 h recombines with another loxP 511 site, but will 
not recombine with a loxP site. 

Integrase: A protein from bacteriophage lambda that mediates the 
integration of the lambda genome into the E. coli chromosome. The 
bacteriophage X Int recombinational proteins promote irreversible recombination 
between its substrate att sites as part of the formation or induction of a lysogenic 
state. Reversibility of the recombination reactions results from two independent 
pathways for integrative and excisive recombination. Each pathway uses a 
unique, but overlappmg, set of the 15 protein binding sites that comprise att site 
DNAs. Cooperative and competitive interactions involving foiur proteins (Int, 
Xis, IHF and FIS) determine the direction of recombination. 

Integrative recombination involves the Int and IHF proteins and sites attP 
(240 bp) and attB (25 bp). Recombination results in the formation of two new 
sites: ottL and attR. Excisive recombination requires Int. IHF, and Xis, and sites 
attL and attR to generate attP and attB. Under certam conditions, FIS stimulates 
excisive recombination. In addition to these normal reactions, it should be 
appreciated that attP and attB, when placed on the same molecule, can promote 
excisive recombination to generate two excision products, one with attL and one 
with attR. Similarly, intermolecular recombination J>etween molecules 
containing attL and ottR^ in the presence of Int, IHF and Xis, can result m 
integrative recombination and the generation attP and attB. He»ice, by flanking 
DNA segments with appropriate combmations of engineered ^ sites, in the 
presence of the appropriate recombination proteins, one can dkect excisive or 
integrative recombination, as reverse reactions of each other. 

Each of the att sites cont^bs a 1 5 bp core sequence; individual sequence 
elements of functional significance lie within, outside, and across the boundaries 
of this common core (Landy, A., Amt Rev. Biochem. 58:9U (1989)). Efficient 
recombination between the various att sites requires that the sequence of the 
central common region be identical between the recombining partners, however, 
the exact sequence is now found to be modifiable. Consequently, derivatives of 
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the att site with changes within the core are now discovered to recombine as least 
as efficiently as the native core sequences, 

hitegrase acts to recombine the attP site on bacteriophage lambda (about 
240 bp) with the attB site on the K coli genome (about 25 bp) (Weisberg, Rj\. 
5 and Landy, A. in Lambda II, p. 21 1 (1983), Cold Spring Harbor Laboratory)), to 

produce the integrated lambda genome flanked by attL (about 1 00 bp) and attR 
(about 1 60 bp) sites. In the absence of Xis (see below), this reaction is essentially 
irreversible. The integration reaction mediated by integrase and THF works 
in vitro, with simple buffer containing spermidine. Integrase can be obtained as 

10 desCTibedby Nash, HJl^ Methods of Enzymology I OOaiO-216 (1983). IHF can 

be obtained as described by Filutowicz, M., et al.. Gene 7^7:149-150 (1994). 

In the presence of the X protein Xis (excise) integrase cataljrzes the 
reaction of attR and attL to form attP and cOtB^ le., it promotes the reverse of the 
reaction described above. This reaction can also be applied in the present 

15 invention. 

Other Recombination Systems. Numerous recombination systems firom 
various organisms can also be used, based on the teaching and guidance provided 
herein. See, e,g,, Hoess et al.. Nucleic Acids Research /^(6):2287 (1986); 
Abremski et al, J, Biol Chem,261{\y39\ (1986); Campbell, J. Bacteriol 

20 I 74(23):7495 (1992); Qian et al., J, Biol Chem. 267(11^:7794 (1992); Araki 

et a/., J. Mol Biol 22J(1):25 (1992)). Many of these belong to the integrase 
fmiily of recombinases (Argoseffl/. £MBOJ; 5:433-440 (1986))^ Perhaps the 
best studied of these are the Integrase/a/t system from bacteriophage X (Landy, 
A. (1993) Current Opimons in Genetics and Devel 5:699-707), the CrdloxP 

25 system from bacteriophage PI (Hoess and Abrmiski (1990) In Nucleic Acids and 

Molecular Biology^ voL 4. Eds.: Eckstein and Lilley, Berlin-Heidelberg: 
Spriiiger-Verlag; pp/9()-109) , and the FLP/FRT system from the Saocl^ 
c^evisiae 2 ii circle plasmid (Broach et al Cell 2P:227-234 (1 982)). 

Membm of a second faxoSSy of site-specific recombinases, the resolvase 

30 femily (e.g., yd, Tn3 resolvase, Hin, (Jin, and Cin) are also known. Members of 

this highly related &mily of recombinases are typically constrained to 
intramolecular reactions {e.g.y inversions and excisions) and can require host- 
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encoded factors. Mutants have been isolated that relieve some of the 
requirements for host factors (Macser and Kahnmann (1991) Mol. Gen. Genet. 
230: 1 70- 1 76), as well as some of the constraints of intramolecular recombination. 
Other site-specific lecombinases similar to X Int and similar to PI Cre can 
5 be substituted for bit and Cre. Such recombinases are known. In many cases the 

purification of such other recombinases has been described in the art. Incases 
when they are not known, cell extracts can be used or the enzymes can be 
partially purified using procedures described for Cre and InL 

While Cre and Int are described in detail for reasons of example, many 
10 related recombinase systems exist and their application to the described invention 

is also provided according to the present invention. The integrase family of site^ 
specific recombinases can be used to provide alternative recombination proteins 
and recombination sites for the present invention, as site-specific recombination 
proteins encoded by bacteriophage lambda, phi 80, P22, P2, 1 86, P4 and PI . This 
15 group of proteins exhibits an unexpectedly large diversity of sequences. Despite 

this diversity, all of the recombinases can be aligned in their C-teiminal halves. 

A 40-residue region near the C temunus is particularly well conserved in 
all the proteins and is homologous to a region near the C terminus of the yeast 2 
mu plasmid Flp protein. Three positions are perfectly conserved within this 
20 family: histidine, arginine and tyrosine are found at laspective alignment 

positions 396, 399 and 433 within the well-conserved C-terminal region. These 
residues contribute to the active site of this family of recombinases, and suggest 
that tyrosine-433 forms a transient covalent linkage to DNA during strand 
cleavage and rejoining. See. e.g.. Argos, P. etal,EMBOJ. 5:433-40 (1986). 
25 Alternatively, IS231 and other Bacillus thuringiensis transposable 

elements could be used as recombination proteins and recombination sites. 
Bacillus thuringiensis is an entomopathogenic bacterium whose toxicity is due 
to the presmce in the sporangia of delta-endotoxin crystals active against 
agricultural pests and vectors of human and animal diseases. Most of the genes 
30 codmg for these toxin proteins are plasmid-bome and are generally structurally 

associatedwithinsertionsequences(IS231,lS232,IS240,ISBTl andISBT2)and 
transposons (Tn4430 and Tn5401). Several of these mobUe elements have been 
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shown to be active and participate in the crystal gene mobility, thereby 
contributing to the variation of bacterial toxicity. 

Structural analysis of the iso-IS23 1 elanents indicates that they are related 
to ISl 151 from Clostridium perfringens and distantly related to IS4 and IS186 
5 from Escherichia coli. Like the other IS4 family members, they contain a 

conserved transposase-integrasc motif found in other IS families and retroviruses. 

Moreover, functional data gatiiered from IS231A in Escherichia coli 
indicate a non-replicative mode of transposition, with a preference for specific 
targets. Similar results wctc also obtained in Bacillus subtilis and B. 

10 thuringiensis. See, e.g., Mahillon, J. etaL, Genetica P3:13-26 (1994); Campbell, 

J, Bacterion49S'lA99 (1992). 

The amount of recombinase \^ch is added to drive the reccmibination 
reaction can be determined by using known assays. Specifically, titration assay 
is used to determine the appropriate amount of a purified recombinase enzyme, 

15 or the appropriate amount of an extract 

Engineered Recombination Siies. The above recombinases and 
corresponding recombinase sites are suitable for use in recombination cloning 
according to the present invention. HowevCT, wild-type recombination sites 
contain sequences that reduce tibe efiBcicncy or specificity of recombination 

20 reactions as applied in methods oftfae present invention. For example, multiple 

stop codons in attB, attR, attP, attL and loxP recombination sites occur in 
multiple reading frames on both strands, so recombination efBciCTicies are 
reducted, e.g., where the coding sequwce xnust cross the recombination sites, 
(only one reading firame is available on eadi strand of loxP and attB sites) or 

25 impossible (in attP, attR or attL). 

Accordingly, the presCTt invention also provides ' engineered 
recombination sites that overcome diese probl^ns. For example, att sites can be 
eng^ieered to have one or muMpIe miitations to enhance specificity 
of the lecombiimtion reaction and fbe properties of Product DNAs (e.^. , attl , at£2, 

30 and att3 sites); to decrease reverse reaction (eg:, rmioving PI and HI fix>m attB). 

The testing of diese mutants determines \diidi mutants yield sufBcient 
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recombinational activity to be suitable for recombination subcloning according 
to the present invention. 

Mutations can therefore be introduced into recombination sites for 
enhancing site specific recombination. Such mutations include, but are not 
limited to: recombmation sites without translation stop codons that allow fiision 
proteins to be encoded; recombination sites recognized by the same proteins but 
differing in base sequence such that they react largely or exclusively with theu^ 
homologous partners allow multiple reactions to be contemplated. Which 
particular reactions take place can be specified by which particular partners are 
present in the reaction mixture. For example, a tripartite protein fusion could be 
accomplished with parental plasmids containing recombination sites attRl and 
attR2; attLl and attL3; and/or attR3 and attL2, 

Th^ are well known procedures for introducing specific mutations into 
nucleic acid sequences. A number of these are described in Ausubel, F.M. et al,, 
15 Current Protocols in Molecular Biology^ AWley Interscience, New Yoric (1989- 

1996). Mutations can be designed into oligonucleotides, which can be used to 
modify existing cloned sequences, or in amplification reactions. Random 
mutagenesis can also be employed if ^propriate selection methods are available 
to isolate the desired mutant DNA or RNA The presence of the desired 
20 mutations can be confirmed by sequencing the nucleic a«id by well known 

methods. 

The following non-limiting methods can be used to mgineer a core region 
of a given recombination site to provide mutated sites suitable for use in the 
present invention: 

1 • By recombination of two parental DNA sequences by site-specific (e.g. 
attL and attR to give attB) or other (e.g. homologous) 
recombination mechanisms. The DNA parental DNA segments 
containing one or more base alterations resulting in the final core 
sequence; 

2. By mutation or mutagenesis (site-specific, PGR, random, spontaneous, 
etc) directly of tfie desired core sequence; 



30 



BNSCXXiDt <WO_^964a724A1_lA> 



wo 96/40724 PCTAJS96/10082 

-28- 

3- By mutagenesis (site-specific, PGR, random, spontanteous, etc) of 
parental DNA sequences, which are recombined to generate a 
desired core sequence; and 
4. By reverse transcription of an RNA encoding Uie desired core 
5 sequence. 

The functionality of the mutant recombination sites can be demonstrated 
in ways that depend on the particular characteristic that is desired. For example, 
the lack of translation stop codons in a recombination site can be demonstrated 
by expressing the appropriate fusion proteins. Specificity of recombination 
10 betwem homologous partners can be demonstrated by introducing the appropriate 

molecules into in vitro reactions, and assaying for recombiriation products as 
desoibed herein or known in fee art Other desired mutations in recombination 
sites might include the presence or absence of restriction sites, translation or 
transcription start signals, protein binding sites, and otfa^ known fimctionalities 
15 of nucleic acid base sequences. Genetic selection schemes for particular 

fimctional attributes in the recombination sites can be used according to known 
method steps. For example, the modification of sites to provide (fix)m a pair of 
sites that do not interact) partners that do interact could be achieved by requiring 
deletion, via recombination between the sites, of a DNA sequence encoding a 

I 20 toxic substance. Similarly, selection for sites that r^Bove translation stop 

i 

I sequences, the presence or absence of protein binding sites, etc., can be easily 

i devised by those skilled in tiie art 

Accordingly, the present invention provides a imcleic acid molecule, 
comprising at least one DNA segment having at least two dng^ieered 
25 recombination sites flanking a Selectable marker and/or a desired DNA segment, 

vdieiein at least one of said recombination sites comprises a core region having 
at least one engineered ihutation that enhances recombination in vitro in the 
formation of a Gointegrate DNA or a Product DNA. 

The nucldc acid molecule can have at least one mutation that confers at 
30 least one enhancCTient of said recombination, said ^ihancement selected fiom tfie 

group consisting of substantially (jl) &voring excisive integration; QS) favoring 
excisive recombination; (ii) relieving the requirement for host factors; (tii) 
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increasing the efficiency of said Cointegrate DNA or Product DNA formation; 
and (iv) increasing the specificity of said Cointegrate DNA or Product DNA 
formation. 

The nucleic acid molecule preferably comprises at least one 
5 recombination site derived from attB, attP, attL or attR. More preferably the att 

site is selected from attl , att2, or att3, as described herein. 

In a preferred embodiment, the core region comprises a DNA sequence 
selected from the group consisting of: 

(a) RKYCWGCTTTYKTRTACNAASTSGB(in-att)(SEQIDNO:l); 
10 (b) AGCCWGCTTTYKTRTACNAACTSGB (m-attB) (SEQ ID 

NO:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID 

NO:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

15 NO:4); 

(e) GTTCAGC'nTYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

NO:5); 

or a corresponding or complementary DNA or RNA sequence, wherein R=A or 

G; K=G or T/U; Y=C or TAJ; W=A or T/U; N=A or C or G or TAJ; S=Cor G; 
20 and B=C or G or TAJ. as presented in 37 CJ.R. §I.823( which is entirely 

incorporated heiein by refoence; wfaodn the core region does not contain a stop 

codon in one or more reading frames. 

The core region also preferably comprises a DNA sequence selected from 

the group consisting of: 
25 (a) AGCCTGCrnTTTGTACAAACTTGT(attBl)(SEQIDNO:6); 

(b) AGCCTGCTTTCTTGTACAAACTTGT(attB2)(SEQIDNO:7); 

(c) ACCCAGCl l ICl lGTACAAACTrGT (attBS) (SEQ ID NO:8); 

(d) GTrCAGCTTTTTTGTACAAACTTGT(attRl)(SEQIDNO:9); 

(e) GTTCAGC 111 CITGTACAAACTTGT (attR2) (SEQIDNOrlO); 
30 (f) GTTCAGCl 11 C 1" 1 GTAGAAAGTTGG (attR3) (SEQ ID 

NO:ll); 
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(g) AGCCTGCTTTTTTGTACAAAGTTGG (attLl) (SEQ ID 
NO:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attL2) (SEQ ID 
NO:13); 

5 (i) ACCCAGCTTTCTTGTACAAAGTTGG (attLB) (SEQ ID 

NO:14); 

G) GTTCAGCTTTTTTGTACAAAGlTGGCattPl) (SEQ ID NO: 1 5); 
(k) GTTCAGCTTTCTTGTACAAAGTTGG (attP2^3) (SEQ ID 
NO:16); or a corresponding or complementary DNA or RNA sequence. 

10 The present invention thus also provides a method for making a nucleic 

acid molecule, comprising providing a nucleic acid molecule having at least one 
engineered recombination site comprising at least one DNA sequence having at 
least 80-99% homology (or any range or value therein) to at least one of SEQ ID 
NOS:l-16, or any suitable recombination site, or which hybridizes under 

15 stringent conditions thereto, as known in the art. 

Clearly, there are various types and permutations of such welHcnown 
in vitro and in vivo selection methods, each of which are not described herein for 
the sake of brevity. However, such variations and permutations are contemplated 
and considered to be the different embodiments of the present invention. 

20 It is important to note that as a result of the prefen^ embodiment being 

m vf/ro recombiriation reactioris, non-biological molecules such as PGR produ^^ 
can be manipulated via the present recombinational cloning method. In one 
example^ it is {K>ssible to clone linear molecules into circular vectors. 

There are a number of educations for &e present invention. These uses 

25 include, but are not limited to, changing vectors, exposing promoters with genes, 

constructing genes for fusion proteins, changing copy numb^, dianging 
replicons, cloning into phages, and cloning, e.g.y PGR products (wiA zn attB site 
at one end and a /oxP site at the odier end), genoiiiic DNAs, aiid cDNAs. 
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The following examples are intended to further illustrate certain preferred 
embodiments of the invention and are not intended to be limiting in nature. 



Examples 

The present recombinational cloning method accomplishes the exchange 
5 of nucleic acid segments to render something useful to the tiser, such as a change 

of cloning vectors. These segments must be flanked on both sides by 
recombination signals that are in the proper orientation with respect to one 
another. In the examples below the two parental nucleic acid molecules (e.^., 
plasmids) are called the Insert Donor and the Vector Donor, the Insert Donor 
10 contains a segment that will become joined to a new vector contributed by the 

Vector Donor. The recombination intermediate(s) that contain(s) both starting 
molecules is called the Comtegrate(s). The second recombination event produces 
two daughter molecules, called the Product (the desired new clone) and the 
Byproduct. 

15 Buffers 



Various known bufifers can be used in the reactions of the present 
invention. For restriction enzymes, it is advisable to use the bufifers 
recommended by the manufacturer. Alternative buffers can be readily found in 
the literature or can be devised by those of ordinary skill in the art 

Exaff^les 1-3. One exemplary buffer for lambda integrase is comprised 
of 50 mM Tris-HCl, at pH 7.5-7.8, 70 mM KCl, 5 mM spermidine, 0.5 mM 
EDTA, and 0.25 mg/ml bovine serum albumin, and optionally, 10% glycerol. 

One preferred buffer for PI Cre recombinase is comprised of 50 mM 
Tris-HCl at pH 7.5, 33 mM NaCl, 5 mM spermidine, and 0,5 mg/ml bovine 
^ 25 serum albumin. 

The buffer for other site-specific recombinases which arc similar to 
lambda Int and PI Cre are either known in the art or can be determined 
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empirically by the skilled artisans, particiilarly in light of the above-described 
buffers. 

Exanq>le 1: Recombinational Cloning Using Cre and Cre & Int 

Two pairs of plasmids were constructed to do the in vitro recombinationa] 
5 cloning method in two different ways. One pair, pEZC705 and pEZC726 

(Figure 2A), was constructed with loxP and att sites, to be used with Cre and 
X integrase. The other pair, pEZC602 and pEZC629 (Figure 3 A), contained the 
/oxP (wild type) site for Cre, and a second mutant lox site, loxP 511^ which 
differs from loxP in one base (out of 34 total). The minimnnr^ requirement for 
10 recombinational cloning of the present invention is two recombination sites in 

each plasmid, in general X and F, and X and T. Recombinational cloning takes 
place if either or both types of site can recombine to form a Cointegrate (e g. X 
and X'\ and if either or both (but necessarily a site different from the type 
forming the Cointegrate) can recombine to excise the Product and Byproduct 
15 plasmids from the Cointegrate {e.g, T and F). It is im}x>rtant that the 

recombination sites on the same plasmid do not recombine. It was found that the 
^ prescait recombinational cloning could be done with Cre alone. 

#- 

Cre-Onty 

Two plasmids were constructed to demonstrate this conception (see 
Figure 3A). pEZC629^ras the Vector Donor plasmid. It contained a constitutive 
drug marker (chloramphenicol resistance), an origin of replication, loxP and 
loxP 511 sites, a conditional drug marker Ocanamycin resistance v^liose 
expression is controlled by the operator/promote: of the tetracycline resistance 
opCTon of transposon T/ilO), and a constitutively expressed gene for the tet 
repressor protein, tetR. K coli cells containing pEZC629 were resistant to 
chloramphCTicol at 30 fig/ml, but sensitive to kanamycin at 100 |ig^. pEZC602 
was the Insert Donor plasmid, ^ch contained a different drug marker 
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(ampicillin resistance), an origin, and loxP and loxP 511 sites flanking a multiple 
cloning site. 

This experiment was comprised of two parts as follows: 
Parti: About 75 ng each of pEZC602 and pEZC629 were mixed in a 
total volume of 30 ^l of Cre buffer (50 mM Tris-HCI pH 7.5, 33 mM NaCl, 
5 mM spermidine-HCl, 500 ^g/ml bovine serum albxmiin). Two 10 ^1 aliquots 
were transferred to new tubes. One tube received 0.5 \x\ of Cre protein (approx. 
4 units per fil; partially purified according to Abremski and Hoess, J. Biol Chem, 
259:1509 (1984)). Both tubes were incubated at 37*C for 30 minutes, then 70"C 
for 10 minutes. Aliquots of each reaction were diluted and transformed into 
DH5a. Following expression, aliquots were plated on 30 ^ig/ml 
chloramphenicol; 100 ng/tol ampiciliin plus 200 jig/ml methicillin; or 100 pg/ml 
kanamycin. Results: See Table 1. The reaction without Ctc gave 1.11x10** 
ampiciliin resistant colonies (fiom the Insert Donor plasmid pEZC602); 7.8x10* 
chloramphenicol resistant colonies (from the Vector Donor plasmid pEZC629); 
and 140 kanamycin resistant colonies (backgroimd). The reaction with added Cre 
gave 7.5x10* ampiciliin resistant colonies (from the Insert Donor plasmid 
pEZC602); 6.1x10* chloramphenicol resistant colonies (from the Vector Donor 
plasmid pEZC629); and 760 kanamycin resistant colonies (mixture of 
background colonies and colonies from the recombination^d cloning Product 
plasmid). Analysis: Because the number of colonies on the kanamycin plates 
was much higher in the presence of Cre, many or most of them were predicted to 
contain the desired Product plasmid. 



25 





Table 1 


^izyme 


Ampiciliin 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


1.1x10* 


^ 7.8x10* 


140 


140/7.8x10*= 0,02% 


Cre 


73x10* 


6.1x10* 


760 


760/6.1x10*= 0.12% 



30 



Part II: Twenty fb\ir colonies from the Cre" kanamycin plates woe 
picked and inoculated into medium containing 1 00 ^ig/ml kanamycin. Minipreps 
were done, and the miniprep DNAs, uncut or cut with Sinai or ///ndm, woe 
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electrophoresed. Results: 19 of the 24 minipreps showed supercoiled plasmid 
of the size predicted for the Product plasmid. All 1 9 showed the predicted Smal 
and HindSH restriction fiagments. Analysis: The Cre only scheme was 
demonstrated. Specifically, it was detmnined to have yielded about 70% (19 of 
5 24) Product clones. The efficiency was about 0. 1 % (760 kanamycin resistant 

clones resulted from 6. 1 x 1 0^ chloramphenicol resistant colonies). 



Cre Plus Integrase 

The plasmids used to demonstrate Uns mediod are exactly analogous to 
those used above, except Aat pEZC726, the Vector Donor plaanid, contained an 

10 attP site in place of loxP 511, and pEZC705, the Lasert Donor plasmid, contained 

an attB site in place of loxP 511 (Figure 2A). 

This experiment was comprised of three parts as follows: 
Part I: About 500 ng of pEZC705 (the Insert Donor plasmid) was cut 
with Scal^ which linearized the plasmid within the ampicillin resistance gene. 

15 (This was done because the k integrase reaction has been historically done with 

the attB plasmid in a linear state (H. Nash, personal conmiunication). However, 
it was found later that the integrase reaction proceeds well with both plasmids 
supercoiled.) Then, the linear plasmid was etfaanol precipitated and dissolved in 
20 »il of X integrase buffer (50 mM Tris-HCl, about pH 7.8, 70 mM KCl, 5 mM 

20 spermidine-HCl, 0.5 mM EDTA, 250 fig^ bovine serum albumin). Also, about 

500 ng of the Vector Donor plasmid pEZC726 was etfaanol jrocipitated and 
dissolved in 20 jil X integrase buflfer. Just before use, \ integrase (2 [il, 
393 ^g/ml) was thawed and diluted by adding 18 jil cold A integrase buffer. 
One nl IHF integration lK>st fector, 2.4 mg^ml, an accessory protein) was diluted 

15 into 150 111 cold A. integrase buffer. AUquots (2 pS) of each DNA were mixed 

with A. integrase 1niffer,widi or without 1 jil each A integrase and IHF, in a total 
of 10 nL The mixture was incubated at 25**C for 45 minutes, then at 70^C for 
10 minutes. Half of each reaction was applied to an agarose gel. Results: ^nih& 
presence of mtegrase and IHF, about 5% of the total DNA was converted to a 

\0 linear Cointegrate form. Analysis: Activity of integrase and IHF was confirmed. 
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Partll: Three microliters ofeach reaction (i.e., with or withomintegrase 
and IHF) were diluted into 27 ^1 of Cre buffer (above), then each reaction was 
split into two 10 ^1 aliquots (four altogether). To two of these reactions, 0.5 fil 
of Cre protein (above) were added, and all reactions were incubated at 3TC for 
30 minutes, then at IQTC for 10 minutes. TE buffer (90 jil; TE: 10 mM Tris-HCl, 
pH 7.5, 1 mM EDTA) was added to each reaction, and 1 ^il each was transfonned 
into £ coli DH5a. The transformation mixtures were plated on 100 |ag/ml 
ampicUlin plus 200 fig/ml methicillin; 30 jig/ml chloramphenicol; or 100 jig/ml 
kanamycin. Results: &e Table 2. 



Table 2 


Enzyme 


Ampicillin 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


990 


20000 


4 


4/2x10* = 0.02% 


Cre only 


280 


3640 


0 


0 


Integrase* 
only 


1040 


27000 


9 


9/ 2.7x10* = 0.03% 


Integrase* 
+ Cre 


110 


1110 


76 


76/1.1x10^ = 6.9% 

==: 



♦ Integrase reactions also contained IHF, 

Analysis: The Cre protem impaired transformationT When adjusted for 
this effect, the number of kanamycin resistant colonies, compared to the control 
reactions, increased more than 100 fold when both Cre and Integrase were used. 
This suggests a specificity of greater than 99%. 

Part III: 38 colonies were picked from the Integrase plus Cre plates, 
miniprep DNAs were maide and cut with HmWl to give diagnostic mapping 
information. Result: AU 38 had precisely the expected fragment sizes. 
Analysis: The Cre plus i mtegrase method was observed to have much higher 
specificity than Cre-alone. Conclusion: The Cre plus k integrase method was 
demonstrated. EflBciency and specificity were much higher than for Cre only. 
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Exan^le 2: Using in vitro Recombinational Cloning to Subclone the 
Chloramphenicol Acetyl Transferase Gene into a Vector for Expression in 
Eukaryotic Cells (Figure 4 A) 

An Insert Donor plasmid, pEZC843, was constructed^ comprising the 
chloramphenicol acetyl transferase gene of K coli^ cloned between loxP and attB 
sites such that the loxP site was positioned at the 5'-end of the gene (Figure 4B). 
A Vector Donor plasmid, pEZClOOS, was constracted, which contained the 
cytomegalovirus eukaryotic promoter apposed to a loxP site (Figure 4C). One 
microliter aliquots of each sup^coiled plasmid (about 50 ng crude miniprep 
DNA) were combined in a ten microliter reaction containing equal parts of 
lambda integrase buffer (50 mM Tris-HCl, pH 7.8, 70 mM KCl, 5 mM 
spermidine, 0^ mM EDTA, 0^5 mg/mi bovine serum albumin) and Cre 
recombinase buffer (50 mM Tris-HCl, pH 7.5, 33 mM NaCl, 5 mM spermidine, 
0^ mg/ml bovine serum albumin), two units of Cre recombinase, 16 ng 
integration host factor, and 32 ng lambda integrase. After incubation at 30*C for 
30 minutes and 75**C for 10 minutes, one microliter was transformed into 
competent E. co/i strain DH5a (Life Technologies, Inc.). Aliquots of 
transformations were spread on agar plates containing 200 ^g/ml kanamycin and 
incubated at 37"C overnight An otherwise identical control reaction contained 
Ae Vector Donor plasmid only. The plate receiving 10% dF the control reaction 
transformation gave one colony; the plate receiving 1 0% of the recombinational 
cloning reaction gave 144 colonies. These numbers suggested that greater than 
99% of the recombinational cloning colonies contained the desired product 
plasmid. Miniprep DNA made from six recombinational cloning colonies gave 
the predicted size plasmid (5026 base pairs), CMVProd. Restriction digestion 
with JVicoI gave the fragments predicted for flie chloramphenicol acetyl transfi^ase 
cloned dowristream of the CMV promoter for all six plasrnids. 
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Subcloned DNA Segments Flanked by attB Sites Without Stop 



Parti: Background 



The above examples are suitable for transcriptional fusions, in which 
transcription crosses recombination sites. However, both attR and loxP sites 
contain multiple stop codons on both strands, so translational fusions can be 
difiBcult, w4iere the coding sequence must cross the recombination sites, (only one 
reading fiame is available on each strand of loxP sites) or impossible (in attR or 
attL). 

A principal reason for subcloning is to fuse protein domains. For 
example, fiision of the glutathione S-transferase (GST) domain to a protein of 
interest allows the fusion protem to be purified by affinity chromatography on 
glutathione agarose (Pharmacia, Inc., 1995 catalog). If the protein of interest is 
fused to runs of consecutive histidines (for example His6), the fusion protein can 
be purified by affinity chromatography on chelating resins containing metal ions 
(Qiagen, Inc.). It is often desirable to compare amino terminal and carboxy 
tenninal fusions for activity, solubiUty, stability, and the like. 

The attB sites of the bacteriophage A, integration sysjpm were examined 
as an alternative to loxP sites, because they are small (25 bp) and have some 
sequence flexibifity (Nash, H.A. et al. Proc. Natl Acad. Set USA 5^:4049-4053 
(1987). It was not previously suggested that multiple mutations to remove all 
stop codes would result in useful recombination sites for recombinational 
subcloning. 

Using standard nomenclature for site specific recombination in lambda 
bacteriophage (Wdsber, in Lambda lU, Hendrix, et al., eds.. Cold Spring Harbor 
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the nucleotide regions that 
coU host cell are represented 

PI — H1--P2--X--H2--C-0-C--H' — P»l--P'2 — P'3-- 

— B-O-B'-- 
Int, IHF II Xis, Int, IHF 

PI - -HI - -P2 - -X — H2 - -C-O-B • - - 

--B-0-C — H' — P»l--P«2 — 

where: O represents the 1 5 bp core DNA sequence found in both the phage and 
K coli genomes; B and B' rqsresent approximately 5 bases adjacent to the core 
in the£ co/i genome; and PI, HI, P2, X, H2, C, ff, Fl, P^, and P3 repr^ 
known DNA sequences encoding protein bmding domains in the bacteriophage 
X genome. 

The reaction is reversible in the presence of the protein Xis (exdsionase); 
recombination between attL and attR precisely excise the A. genome ftom its 
integrated stale, regenaating the cnx:ular X g^ome contaming attP and the linear 
£ co// genome containing attB. 

Part JI: Construction and Testing ofPlasmids Coniaining Mutant att Site^ 

Mutant attL and attR sites were constructed. Importantly, Landy et al 
{Ann, Rev, Biochent 58:913 (1989)) observed that deletion of Ae PI and HI 
domains of attP facilitated the excision reaction and elimiiiated the i^^ 
reaction, thereby making the excision reaction irreversible. Therefore, as 
mutations were introduced in attR, the PI and HI domains were also deleted. 
attR sites in the present example lack the P I and H I regions and have the Ndel 
site removed (base 27630 changed from C to G), and contain sequences 
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Laboratory, Cold Spring Harbor, NY (1989)), 
participate in the recombination reaction in an 
as follows: 



attP — 



attB 



attR 



atti. 
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corresponding to bacteriophage X coordinates 27619-27738 (GenBank release 
92.0, bg:LAMCG, "Complete Sequence of Bacteriophage Lambda"). 

The sequence of attB produced by recombination of wild type attL and 
attR sites is: 



B O B' 

AGCCT GCTTrXTi'ATA CTAa C TTGA 3* (SEQ. ID 

NO:31) 

TCGGA CGAAA AAAmTGAT T GAACT 5> 

The sftop codons are italicized and underiined. Note that sequences of attL, attR, 
and attP can be derived from the attB sequence and the boundaries of 
bacteriophage X contained within attL and attR (coordinates 27619 to 27818), 
When mutant attRl and attLl sites were recombined the sequence attBl 
was produced (mutations in bold, large font): 



attBwt I 5 • 

3 • 



B O B' 

attBl: 5- AGCCT gCTTTTTtGtAcAaA CTTgT 3 ^^^Q' ™ ^0:6) 

3* TCGGA cGAAJSlAAaCaTGTtT GAAcA 5* 

0 

Note that the four stop codons are gone. 

When an additional mutation was introduced in the attRl and attLl 
sequences (bold), attR2 and attL2 sites resulted. Recombmation of attR2 and 
attL2 produced the attB2 site: 



B o B • 

^^^2= 5' GCTTtCtTCTACAAA ^' (SEQ. ID NO:7) 

^' cgaaaGaacatcttt sr 



The recombination activities of the above attL and attR sites wero assayed 
as follows. The attB site of plasmid pEZC705 (Figure 2B) was replaced with 
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attLwt, attLl, or attL2. The attP site of plasmid pEZC726 (Figure 2C) was 
replaced with attRwt (lacking regions PI and HI), attRl, or attR2. Thus, the 
resulting plasmids could recombine via their loxP sites, mediated by Cre, and via 
their attR and attL sites, mediated by hit, Xis, and IHF, Pairs of plasmids were 
mixed and reacted with Cre, Int, Xis, and IHF, transfomied into K coli competent 
cells, and plated on agar containing kanamycin. The results are presented in 
Table 3: 



Tables 



Vector donor att site 


Gene donor att site 


# of kanamycin resistant colonies^ 


attRwt (pEZCnOl) 


None 


1 (background) 




attLwt(pEZC1313) 


147 


tt 


attLl (pEZC1317) 


47 


n 


attL2(pEZC1321) 


0 


attRl (pEZC1305) 


None 


1 (background) 


ti 


attLwt(pEZC1313) 


4 


ft 


attLl (pEZC1317) 


128 


ti 


attL2(pEZC1321) 


0 


attR2(pEZC1309) 


None 


9 (background) 


tt 


attLwt(pEZC1313) 


0 


ft 


attL2(pEZC1317) 


0 


" 


attL2(pEZC1321) 


209 



(* 1% of each transformaticHi was spread on a kanamycin plate^ 



The above data show that whereas the wild type att and attl sites 
recombine to a small extent, the attl and att2 sites do not recombine detectably 
with each other. 

Part in. Recombination Avas demonstrated ^(^dien the core region of both 
attB sites flanking the DNA segment of interest did not contain stop cbdons. The 
physical state of the participating plasmids was discovered to influence 
recombination eflBciency. 

The £q>propiiate att sites were moved into pEZC705 and pEZC726 to 
make the plasmids pEZC1405 (Figure 5G) (attRl and attR2) and pEZC1502 
(Figure 5H) (attLl and attL2), ThedesiredDNAsegment in this experiment was 
a copy of the cUoramph^col resistance gene cloned between the two attL sites 
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of pEZC1502. Pairs of plasmids were recombined in vitro using Int, Xis, and 
IHF (no Cre because no loxP sites were present). The yield of desired kanamycin 
resistant colonies was detemiined when both parental plasmids were circular, or 
when one plasmid was circular and the other linear as presented in Table 4: 



Table 4 



Vector donor' 


Gene donor' 


Kanamycin resistant colonies^ 


Circular pEZC 1405 


None 


30 


Circular pEZC 1405 


Circular pEZC 1 502 


2680 


Linear pEZC1405 


None 


90 


Linear pEZCHOS 


Circular pEZC1502 


172000 


Circular pEZC 1 405 


Linear pEZC1502 


73000 



' DNAs were purified with Qiagen columns, concentrations determined by 
A260, and linearized with Xba I (pEZC1405) or AlwN I (pEZC1502X Each 
reaction contained 100 ng of the indicated DNA. All reactions (10 jil total) 
contained 3 pi of en2yme mix (Xis. Int, and IHF). After incubation (45 minutes 
at 25**, 10 minutes at 65*), one pi was used to transform E, coli DH5a cells. 
^Number of colonies expected if the entire transformation reaction (1 ml) had been 
plated. Either 100 pi or 1 pi of the transformations were actually plated. 

Anafysis: Recombinational cloning using mutant attR and attL sites was 
confirmed The desired DNA segmait is subcloned between attB sites that do not 
contain any stop codons in either strand. The enhanced yield of Product DNA 
(when one parent was linear) was unexpected because of earilfer obsCTvations that 
the excision reaction was more efficient when both participating molecules were 
supercoiled and proteins were limiting (Nunes-Duby et aL, Cell 50:779-788 
(1987). 



Example 4: Demonstration of Recombinational Cloning WUhosti Inverted 
Repeats 

Parti: Rationale 

The above Example 3 showed that plasmids containing inverted repeats 
of the appropriate recombination sites (for example, attLl and attL2 in plasmid 
pEZC1502) (Figure 5H) could recombine to give the desired DNA segment 
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flanked by attB sites without stop codons, also in inverted orientation. A concern 
was the in vivo and in vitro influence of the inverted repeats. For example, 
transcription of a desired DNA segment flanked by attB sites in inverted 
orientation could yield a single stranded RNA molecule that might form a hairpin 
structure, thereby inhibiting translation. 

Inverted orientation of similar recombination sites can be avoided by 
placing the sites in direct repeat arrangement att sites. If parental plasmids each 
have a wild type attL and wild type attR site, in direct repeat the Int, Xis, and IHF 
proteins will simply remove the DNA segment flanked by those sites in an 
intramolecular reaction. However, the mutant sites described in the above 
Example 3 suggested that it might be possible to inhibit the intramolecular 
reaction while allowing the intermolecular recombination to proceed as desired. 

Parill: Structure of Plasmids Without Inverted Rq>eats for Recombinational 
Cloning 

The attR2 sequence in plasmid pEZC1405 (Figure 5G) was replaced vsdth 
attL2, in the opposite orientation, to make pEZC1603 (Figure 6A). The attL2 
sequence of pEZC1502 (Figure 5H) was replaced with attR2, in the opposite 
orientation, to make pEZCl 706 (Figure 6B). Each of these plasmids contained 

■ m 

mutations in the core region that make intramolecular reactions between attl and 
att2 cores veiy inefficient (see Example 3, above), 

Plasmids pEZC 1 405, pEZC 1502, pEZC1603 and pEZC1706 w«e 
purified on Qiagen columns (Qiagcn, Inc.). Aliquots of plasmids pEZC1405 and 
pEZC1603 were Imearized wifli Xba I. Aliquots of plasmids pEZC1502 and 
pEZC1706 were linearized with AlwN L One hundred ng of plasmids were 
mixed in buffer (equal volumes of 50 mM Tris HCl pH 7.5, 25 mM Tris HCl pH 
8.0, 70 mM KQ, 5 mM spermidine, 0.5 mM EDTA, 250ng/ml BSA, 10% 
glycerol) containing Int (43.5 ng), Xis (43 ng) and IHF (8.1 ng) in a final volume 
oflO jil. Reactions were mcubated for 45 minutes at 25 •'C, 10 minutes at 65 
and 1 fil was transformed into £ coli DHSa. After expression, aliquots were 
spread on agar plates containing 200 ^g/ml kanamycin and incubated at 37*'C. 
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Results, expressed as the niunber of colonies per 1 \i\ of recombination 
reaction are presented in Table 5: 



Tables 



Vector Donor 


Gene Donor 


Colonies 


Predicted % product 


Circular 1405 


— 


100 


— 


Circular 1405 


Circular 1502 


3740 


3640/3740 = 97% 


Linear 1405 




- 90 




Linear 1405 


Circular 1502 


172,000 


171,910/172,000 = 99.9% 


Circular 1405 


Linear 1502 


73.000 


72,900/73,000 = 99.9% 










Circular 1603 




80 




Circular 1603 


Circular 1706 


410 


330/410-80% 


Linear 1603 




270 




Linear 1603 


Circular 1706 


7000 


6730/7000 = 96% 


Circular 1603 


Linear 1706 


10,800 


10,530/10,800 = 97% 



Analysis. In all configurations, i.e., circular or linear, the pEZC1405 x 
pEZC1502 pair (with att sites in inverted rq)eat configuration) was more efficient 
than pEZC1603 x pEZC1706 pair (with att sites mutatefl to avoid haiipin 
formation). The pEZC1603 x pEZC1706 pair gave higher backgrounds and 
lower efficiencies than the pEZC1405 x pEZC1502 pair. While less efficient, 
80% or more of the colonies trom the pEZC1603 x pEZC1706 reactions were 
expected to contain the desired plasmid product Making one partner linear 
stimulated the reactions in all cases. 

Part III: Confirmation of Product Plasmids' Structure 

Six colonies each from the linear pEZC1405 (Figure 5G) x circular 
pEZC1502 (Figure 5H), circular pEZC1405 x linear pEZC1502. linear 
pEZC1603 (Figure 6A) x circular pEZC1706 (Figure 6B), and circular 
pEZC1603 x linear pEZC1706 reactions were picked into rich medium and 
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miniprep DNAs were prepared. Diagnostic cuts with Ssp I gave the predicted 
restriction fragments for all 24 colonies. 

Analysis. Recombination reactions between plasmids with mutant attL 
and attR sites on the same molecules gave the desired plasmid products with a 
high degree of specificity. 

Example 5: Recombinational Cloning with a Toxic Gene 
Parti: Background 

Restriction enzyme Dpri I recognizes the sequence GATC and cuts that 
sequence only if the A is methylated by the dam methylase. Most commonly 
used E. coll strains are dam^. Expression of Dpn I in dam* strains of coli is 
lethal because the chromosome of the cell is chopped into many pieces. 
However, in dam' cells expression of Dpn I is innocuous because ihe 
chromosome is immune to Dpn I cutting. 

In the gmeral lecombin^onal cloning scheme, in \^iiich the vector donor 
contains two segments C and D separated by recombination sites, selection for 
the desired product depends upon selection for the presence of segment D, and 
the absence of segment C. In the original Example segmenjD contained a drug 
resistance gene (Km) that was negatively controlled by a repressor gene found on 
segment C When C was present, cells containing D were not resistant to 
kanamycin because the resistance gene was tumed off. 

The I>pn I gene is an example of a toxic gene that can replace the 
repressor g«ie of the above embodiment If segment C e^qiresses flie Dpn I gene 
product, transforming plasmid CD into a dam* host kills Ae cell. If segment D 
is transferred to a new plasmid, for example by recombinational cloning, thai 
selecting for the drug marker will be successful because the toxic gene is no 
longer present 



wo 96/40724 PCT/US96/10082 

-45- 

Part II: Construction of a Vector Donor Using Dpn I as a Toxic Gene 

The gene encoding Dpn I endonuclease was amplified by PCR using 
primers 5'CCA CCA CAA ACG CGT CCA TGG AAT TAC ACT TTA ATT 
TAG3* (SEQ. ID NO: 17) and 5'CCA CCA CAA GTC GAC GCA TGC CGA 
CAG CCT TCC AAA TGT3' (SEQ. ID NO: 1 8) and a plasmid containing the Dpn 
I gene (derived from plasmids obtained from Sanford A. Lacks, Brookhaven 
National Laboratory, Upton, New York; also available from American Type 
Culture Collection as ATCC 67494) as the template. 

Additional mutations were introduced into the B and B' regions of attL 
and attR, respectively, by ampli^^g existing attL and attR domains with primers 
containing the desired base changes. Recombination of the mutant attL3 (made 
with oligo Xisl 15) and attR3 (made with oligo Xisl 12) yielded attB3 with the 
following sequence (differences from attBl in bold): 

B o B' 

ACCCA GCTTTCTTGTACAAA GTGGT (SEO. id NO: 8) 

tGggT cgaaaGaacatgttt CaCca 

The attL3 sequence was cloned in place of attL2 of an easting Gene Donor 
plasmid to give the plasmid pEZC2901 (Figure 7A). The attR3 sequence was 
cloned in place of attR2 in an existing Vector Donor plasmid to give plasmid 
pEZC2913 (Figure 7B) Dpn I gene was cloned into plasmid pEZC291 3 to replace 
the tet repressor gene. The resulting Vector Donor plasmid was named 
pEZCBlOl (Figure 7C). When pEZCSlOl was transformed into the dam' stram 
SCSI 1 0 (Stratagene), hundreds of colonies resulted. When the same plasmid was 
transformed into the dam+ strain DH5a, only one colony was produced, even 
though the DH5a cells were about 20 fold more competent than the SCSI 10 
cells. When a related plasmid that did not contain the I>pn I gene was 
transformed into the same two cell lines, 28 colonies were produced from the 
SCSI 10 cells, \**ile 448 colonies resulted from the DH5a cells. This is evidence 



wo 96/40724 PCT/US96/10082 

-46- 

that the Dpn I gene is being expressed on plasmid pEZCSlOl (Figure 7C), and 
that it is kiUing the dam* DH5a cells but not the dam" SCSllO cells. 

Pan III: Demonstration of Recombinationat Cloning Using Dpn I Selection 

A pair of plasmids was used to d^onstrate recombinational cloning with 
selection for product dependent upon the toxic gene Dpn I. Plasmid pEZC3 1 0 1 
(Figure 7C) was linearized with Mlu I and reacted with circular plasmid 
pEZC2901 (Figure 7A). A second pair of plasmids using selection based on 
control of drug resistance by a repressor gene was used as a control: plasmid 
pEZC1802 (Figure 7D) was linearized with Xba I and reacted with circular 
plasmid pEZC1502 (Figure 5H). Eght microliter reactions containing the same 
buffer and protems Xis, Int, and IHF as in previous examples ware incubated for 
45 minutes at 25 ''C, thenlO minutes at 75**C, and 1 /d aliquots were transformed 
into DH5a O e., dam+) competent cells, as presented in Table 6. 



Table 6 



Reaction # 


Vector donor 


Basis of selection 


Gene donor 


Colonies 


1 


p£ZC3101/Mlu 


Dpn I toxicity 




3 . 


2 


p£ZC3101/Mhi 


I^m I toxicity 


Circular pEZC2901 


4000 


3 


pEZC1802/Xba 


Tctrqpressor 




0 


4 


pEZC]802OCba 


Tetrq>ressor 


Circular pEZC15Q2 


12100 



Mmiprep DNAs were prepared from four colonies from reaction #2, and 
cut with restriction enzyme Sspl. All gave the predicted fragments. 

Anafysis: Subcloning using selection witb a toxic gene was 
demonstrated. Plasmids of the predicted structure were produced, . 
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Example 6: Cloning of Genes with Uracil DN A Glycosylase and Subcloning 
of the Genes with Recombinational Cloning to Make Fusion Proteins 

Part I: Converting an Existing Expression Vector to a Vector Donor for 
Recombinational Cloning 

5 A cassette useful for converting existing vectors into functional Vector 

Donors was made as foUows. Plasmid pEZC3 1 01 (Figure 7C) was digested with 
Apa I and Kpn I, treated with T4 DNA polymerase and dlMTPs to render the ends 
blunt, fitttfaer digested with Sma I, Hpa I, and AlwN I to render the undesirable 
DNA fiagments small, and Ae 2.6 ld> cass^ containing the attRl - Cm"^ - Dpn I 
10 - attR-3 domains was gel purified. The concentration of the purified cassette was 

estimated to be about 75 ng DNA/^1. 

Plasmid PGEX-2TK: (Figure 8A) (Pharmacia) allows fiisions between the 
protein ^utathione S transferase and any second coding sequence that can be 
inserted in its multiple cloning site. pGEX-2TK: DNA was digested with Sma I 
15 and treated with alkaline phosphatase. About 75 ngofthe above purified DNA 

cassette was ligated with about 100 ng of the pGEX-2TK vector for 23 hours in 
a 5 pi ligation, ttien 1 was transfonned into competent BRL 3056 cells (a dam- 
derivative of DHIOB; dam- strams commercijilly available include DM1 from 
Life Technologies, Inc., and SCS 110 fix>m Stiatagene)r Aliquots of the 
20 transformation mixture were plated on LB agar containing 100 ng/ml ampicillin 

(resistance gene present on pGEX-2TK) and 30 jig/ml chloramphenicol 
(resistance gene present on the DNA cassette). Colonies were picked and 
miniprep DNAs were made. The orientation of the cassette in pGEX-2TK was 
determined by diagnostic cuts with EcoR L A plasmid with the desired 
25 orientation was nanied pEZC3501 (Figure 8B). 



BNSrOCID: <WO_96«>724A1_IA> . ;, .. 

iiiiiiiii -11 '-iT Ill I'liiiii - i --II I ■ - ■ •■ --i-iir-iit-tii ■ ■ I II ' ' I lif^iSiiiiiiili 



wo 96/40724 PCr/US96/l 0082 

-48- 

Part II: Cloning Reporter Genes Into an Recombinaiional Cloning Gene 
Donor Plasmid in Three Reading Frames 

Uracil DNA glycosylase (UDG) cloning is a method for cloning PGR 
amplification products into cloning vectors (U-S. patent No. 5^34,515, entirely 
incorporated herein by reference). Briefly, PGR amplification of the desired 
DNA segment is perfomied with primers that contain xiracil bases in place of 
thymidine bases in their 5* ends. When such PGR products are incubated with the 
enzyme UDG, the uracil bases are specifically removed. The loss of these bases 
weakens base pairing in the ends of the PGR product DNA, and when incubated 
at a suitable temperature (e.g., 37 ^'G), the ends of such products are largely single 
stranded. If such incubations are done in the presence of linear cloning vectors 
containing protmding 3* tails that are complementary to the 3' ends of the PGR 
products, base pairing eflBciently anneals the PGR products to the cloning vector. 
When the annealed product is introduced into E, coli cells by transformation, in 
vivo processes efiBciently convert it into a recombinant plasmid. 

UDG cloning vectors that enable cloning of any PGR product m all three 
reading firames were prepared from pEZG3201 (Figine 8K) as follows. Eight 
oligonucleotides were obtained from Life Technologies, Inc. (all written 5* - 3': 
rfl top (GGGG GAT TAG GAT ATC GCA ACG AGG GAA AAG GTG TAT 
TTT GAG GGT) (SEQ. ID NO:19), rfl bottom (GAG GTT TTG CX5T GGT 
TGG GAT ATG GTA ATC)(SEQ. ID NO:20), rC top (GGCGA GAT TAG 
GAT ATG GCA AGG AGG GAA AAG GTG TAT TTT GAG GGTXSEQ. ID 
NO:21), rf2 bottom (GAG GTT TTG GGT GGT TGG GAT ATG GTA 

ATG TKSEQ. ID NO:22), rf3 top (GGGGAA GAT TAG GAT ATG GCA ACG 
ACC GAA AAG GTG TAT ITT GAG GGT)(SEQ. ID NO:23), rO bottom 
(GAG GTT TTG GGT GGT TGG GAT ATG GTA ATG TT)(SEQ. ID NO:24), 
carboxy top (AGG GTT TAG GTG GAG)(SEQ. ID NO:25) and carboxy 
bottom (TGGA GTG GAG GTA AAG GGT TGG GAG TTA TrA)(SEQ. ID 
NO:26). The rfl, 2, and 3 top strands and the carboxy bottom strand were 
phosphorylated on their 5' ends with T4 polynucleotide kinase, and thai the 
complementary strands of each pair ware hybridized. Plasmid pEZC3201 
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(Figure 8K) was cut with Not I and Sal I, and aliquots of cut plasmid were mixed 
with the carboxy-oligo duplex (Sal I end) and either the rfl, rE2, or rfj duplexes 
(Not I ends) (10 ng cut plasmid (about 5 pmol) mixed with 250 pmol carboxy 
oligo duplex, split into three 20 pi volumes, added 5 jil (250 pmol) of rfl, rf2, or 
rB duplex and 2 fil = 2 units T4 DNA ligase to each reaction). After 90 minutes 
of ligation at room temperature, each reaction was applied to a preparative 
agarose gel and the 2. 1 kb vector bands were eluted and dissolved in 50 nl of TE. 

Part III: PCR of CAT and phoA Genes 



Primers were obtained from Life Technologies, Inc., to amplify the 
10 chloramphenicol acetyl transferase (CAT) gene from plasmid pACYCl 84, and 

phoA, the alkaline phosphatase gene from K coli. The primers had 12-base 5' 
extensions containing uracil bases, so that treatment of PCR products with uracil 
DNA glycosylase (UDG) would weaken base pairing at each end of the DNAs 
and allow the 3' strands to anneal with the protruding 3' ends of the rfl, 2, and 3 
15 vectors described above. The sequences of the primers (aU written 5' - 3*) 

were: CAT left, UAU UUU CAG GGU ATG GAG AAA AAA ATC ACT GGA 
TAT ACC (SEQ. ID NO:27); CAT right, UCC CAC UUA UUA CGC CCC 
GCC CTG CCA CTC ATC (SEQ. ID NO:28); phoA left. UAW UUU CAG GGU 
ATG CCT GTT CTGGAA AAC CGG (SEQ. ID NO:29); and phoA right, UCC 
CAC UUA UUA TTT CAG CCC CAG GGC GGC TTT C (SEQ. ID NO:30). 
The primers were then used for PCR reactions using known method steps {see. 
e.g.. U.S. patent No. 5334.515, entirely incorporated herein by reference), and 
the polymerase chain reaction amplification products obtained with these primers 
comprised the CAT or phoA genes with the initiating ATGs but without any 
25 transcriptional signals. In addition, the uracil-containmg sequences on the amino 

temiini encoded the cleavage site for TEV protease (Life Technologies, liicX and 

those on the carboxy terminal encoded consecutive TAA nonsense codons. 

Unpurified PCR products (about 30 ng) were mbced with the gel purified, 
linear rfl , rf2, or rC cloning vectors (about 50 ng) in a 1 0 ^ reaction containing 
IX REact 4 buffer (LTI) and 1 unit UDG (LTI). After 30 minutes at ST'C, 1 fil 



20 



30 
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aliquots of each reaction were transformed into competent E. coli DH5a cells 
(LIT) and plated on agar containing 50 fig/ml kanamycin. Colonies were picked 
and analysis of miniprep DNA showed that the CAT gene had been cloned in 
reading fiame 1 ^EZC3601)(Figure 8Q, reading fiame 2 (pEZC3609XFigure 
8D) and reading frame 3 (pEZC3617)(Figure 8E), and that the phoA gene had 
been cloned in reading fiame 1 (pEZC3606)a^igure 8F), reading frame 2 
(pEZC3613XFigure 8G) and reading frame 3 (pEZC3621)(Figure 8H). 

Part IV: SubdoningofCAT or phoAfrom UDG Cloning Vectors into a GST 
Fusion Vector 

Plasmids encoding fusions between GST and either CAT or phoA in all 
three reading frames were constructed by recombinational cloning as follows. 
Miniprep DNA of GST vector donor pEZC3501 (Figure 8B) (derived from 
Phamjacia plasmid pGEX-2TK as described above) was linearized with Cla I. 
About 5 ng of vector donor were mixed with about 10 ng each of the appropriate 
circiilar gene donor vectors containing CAT or phoA in 8 nl reactions containing 
buffer and recombination proteins Int,Xis, and IHF (above). Afta- incubation, 
1 fil of each reaction was transformed into K coli strain DH5a and plated on 
ampicillin, as presented in Table 7. 



Table 7 



20 



25 



DNA 


Colonies (10% of each transfoimation) 


Linear vector donor (pEZC3501/Cla) 


0 


Vector donor + CAT rfl 


110 


Vector donor + CAT rO 


71 : 


Vector donor + CAT rO 


148 


Vector donor + phoA rfl 


121 


Vector donor + phoA rf2 


128 


Vector donor + phoA rO 


31 
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PartV: Expression of Fusion Proteins 
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Two colonies from each transformation were picked into 2 ml of rich 
medium (CircleGrow, BiolOl Inc.) in 17 x lOO mm plastic tubes (Falcon 2059, 
Becton Dickinson) containing 1 00 pg/ml ampicillin and shaken vigorously for 
5 about 4 hours at 37 "^C, at time the cultures were visibly turbid. One ml of 

each culture was transferred to a new tube containing 1 0 ^1 of 10% (w/v) IPTG 
to induce expression of GST. After 2 hours additional incubation, all cultures had 
about the same turbidity; the A600 of one culture was 1 .5. Cells from 035 ml 
each culture were harvested and treated with sample buffer (containing SDS and 
10 P-mercaptoethanol) and aliquots equivalent to about 0.15 A600 units of cells 

were applied to a Novex 4-20% gradient polyacrylamide gel. Following 
electrophoresis the gel was stained with Coomassie blue. 

Results: Enhanced expression of single protein bands was seen for all 12 
cultures. The observed sizes of these proteins correlated well with the sizes 
15 predicted for GST being fused (through attB recombination sites without stop 

codons) to CAT or phoA in three reading frames: CATrfl = 269 ammo acids; 
CAT rfZ = 303 amino acids; CAT r£3 = 478 amino acids; phoA rfl = 282 amino 
acids; phoA rf2 = 280 amino acids; and phoA rf3 = 705 amino acids. 

Anafysis: Both CAT and phoA genes were subclone^ into a GST fiision 
20 vector in all three reading frames, and expression of the six fiision proteins was 

demonstrated. 

While the foregoing invention has been described in some detail for 
purposes of clarity and understandmg, it will be ^>predated by one skiUed in the 
ait from a reading of this disclosure that various changes in form and detail can 
25 be made without departmg from the tme scope of the invention and appended 

claims. All patents and publications cited herein are entirely incorporated herein 
by reference. 



BNSDCWD: <WO_S640724Al_tA> 



wo 96/40724 

What Is Claimed Is: 



-52- 



PCTAJS96/10082 



1 . A Vector Donor DNA molecule comprising a first DNA segment 
and a second DNA segment, said first or second DNA segment containing at least 
one Selectable marker, A^erein the first and second segments are sejjarated eitha- 
by, (i) in a circular Vector Donor, a first and a second recombination site, or 
00 in a linear Vector Donor, at least a first recombination site, Miiercin each pair 
of flanking recombination sites are engineered and do not recombine with each 
other. 

2. A Vector Donor DNA molecule according to claim l,\)^1ierein the 
Selectable marker is at least one DNA segment selected fiom the group consisting 
of: 

(i) a DNA segment that encodes a product that provides 
resistance against otherwise toxic compounds; 

GO a DNA segment that encodes a product that is otherwise 
lacking in the recipient cell; 

(iii) a DNA segment that encodes a product that suppresses the 
activity of a gene product; 

(iv) a DNA segment that encodes a prodfict that can be readily 
identified; 

(v) a DNA segment that encodes a product that is detrimental 
to cell survival and/or fimction; 

(vi) a DNA segment that inhibits the activity of any of the 
DNA segments of OHv) above ; 

(vii) a DNA segment that binds a prckluct that modifies a 
substrate; 

(viii) a DNA segment that provides for isolation of a desir^ 
molecule; and 

Ox) a DNA segment that encodes a specific nucleotide 
sequence w*ich can be otherwise non-fimctional; and 
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(x) a DNA segment that, when absent, directly or indirectly 
confers sensitivity to particular compounds. 

3. A Vector Donor DNA according to claim 2, wherein said 
Selectable marker is at least one selected firom the group consisting of an 
antibiotic resistance gene, a tRNA gene, an auxotrophic marker, a toxic gene, a 
phenotypic marker, an antisensc oligonucleotide; a restriction endonuclease; a 
restriction endonuclease cleavage site, an enzyme cleavage site, a protein binding 
site; and a sequence complementary PCR primer. 



4. A Vector Donor DNA molecule according to claim 1 , vdierein said 
Selectable marker comprises at least one inactive fiagment of a Selectable 
marker, wherein the inactive fragment is capable of reconstituting a functional 
Selectable marker when rccombmed across said first or second recombination site 
with a further DNA segment comprising another inactive fragment of the 
Selectable marker. 

5. An Insert Donor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites are engineered and do not 
recombine with each other. 

6. An Insert Donor DNA molecule according to claim 5, v^ierein 
said desii^ DNA segment codes for at least one selected from the group 
consisting of a clonmg site, a restriction site, a promoter, an operon, an origin of 
repKcation, a fimctionai DNA, an antisense RNA, a PCR fragment, a protein or 
a protein fragnient 

7. A kit compismg a container being compartmentalized to receive 
in close confinement therein at one compartment, wherein a first compartment 
contains a Vector Donor DNA molecule comprismg a first DNA segment and 
second DNA segment, said first or second DNA segment containing at least 



a 
one 
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Selectable maiker, wherein the first and second segments are flanked either by, 
(i) in a circxilar Vector Donor, a first and a second recombination site, or (ii) in 
a linear Vector Donor, a first recombination site, wherein each pair of flanking 
recombination sites are engmeered and do not recombine with each other. 

8. A kit according to claim 7, fiitther comprising a second 
compartment containing an Insat Donor DNA molecule comprising a desired 
DNA segment flanked by a first recombination site and a second recombination 
site, wherein the first and second recombination sites are engineered and do not 
recombine wi& each oiber. 

9. A kit according to claim 7, fiirther comprising an additional 
compartment containing at least one recombination protein capable of 
TBcombining a DNA segment comprising at least one of said recombination sites. 

10. A nucleic acid molecule, comprising at least one DNA seginent 
having at least two recombination sites flanking a Selectable marker or a desired 
DNA segment, \iiiCTein at least one of said recombinatiori sites comprises a core 
region having at least one engineered mutation that enhances recombination in 
vitro in the formation of a €:k>integiate DNA or a Product DNA. 

11. A nucleic add molecule according to claim 10, v^CTein said 
mutation confers at least one enhancement of- said recombination, said 
oihancement selected from flie group consisting of substantiaUy (i) favoring 
excisive recombination; (ii) fevoring integrative recombination; (iii) relieving the 
requirement for host fectors; (iv) inraeasing the efBdency of said Cointegrate 
DNA or Pioduct DNA fonnation;(v) increasing the spedficityr of said Cointegrate 
DNA or Product DNA foimatiom and contributes desirable attributes to the 
Product DNA. 
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12. A nucleic acid molecule according to claim 10, wherein said 
recombination site is derived from at least one recombination site selected from 
the group consisting of attB, attP, attL and attR. 

13. A nucleic acid molecule according to claim 12, wherein said att 
5 site is selected from the groups consisting of attl, att2 and att3. 

14. A nucleic acid according to claim 1 0, wherein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) RKYCWGCTTrYKTRTACNAASTSGB(m-att)(SEQIDNO:l); 

(b) AGCCWGCTTTYKTRTACNAACTSGB (m-attB) (SEQ ID 

10 NO:2); 

(c) GTTCAGCTTTCKTRTACNAACTSGB (m-attR) (SEQ ID 

NO:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

N0:4); 

(e) GTTCAGCnTYKTRTACNAAGTSGB(m-attPl) (SEQ ID 
NO:5); 

and a corresponding or complementary DNA or RNA sequence, wherein R=A or 
G; K=G or TAJ; Y=C or T/U; W=A or T/U; N=A or C or Q-or T/U; S=Cor G; 
and B=C or G or TAJ. 

^® 1 5. A nucleic acid according to claim 14, wherein said core region 

comprises a DNA sequence selected from the group consisting of: 

(a) AGCCTGCrmTTGTACAAACTTGT(attBl)(SEQIDN0:6); 

(b) AGCCTGCTrTCTTGTACAAACTTGT(attB2)(SEQIDNO:7); 

(c) ACCCAGCnTCTrGTACAAACTTGT(attB3)(SEQIDNO:8); 
25 (d) GTTCAGCTrnTTGTACAAACTTGT(attRl)(SEQIDNO-9); 

(e) GTrCAGCnTCTTGTACAAACITGT(attR2)(SEQIDNO:10); 

(f) GTTCAGCl 1 ICllGTACAAAGTTGG (attR3) (SEQ ID 
NO: 11); 
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(g) AGCCTGCTTTTTTGTACAAAGTTGG (attLl) (SEQ ID 
NO:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attL2) (SEQ ID 
NO:13); 

5 (i) ACCCAGCTTTCTTGTACAAAGTTGG (attL3) (SEQ ID 

NO:14); 

(i) GTrCAGCrmTTGTACAAAGTTGG(attPl) (SEQIDNO:15); 
(k) GTTCAGCTTTCTTGTACAAAGTTGG (attP2,P3) (SEQ ID 

NO: 16); 

10 and a corresponding or complementary DNA or RNA sequence. 

16. A method for making a nucleic acid molecule, comprising 
providing a nucleic acid molecule having at least one engineered 

recombination site comprising at least one DNA sequence haying at least 90% 
homology to at least one of SEQ ID NOS: 1-1 6. 

17. A nucleic acid molecule provided by a method according to 
claim 16. 

18. A composition, comprising a nucleic acid molecule according to 
claim 10. 
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19. A Idt, comprising a container being compartmentalized to receive 
in close confinement therein at least one compartment, vdierein a: first 
compartment contains a composition according to claim 1 8, 

20. A kit according to clahn 19, fiirther comprising a second 
compartment having at least one recombination protein that recognizes said 
recombination site. 
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21. A kit comprising a container being compartmentalized to receive 
in close confinement therein at least one recombination protein in isolated fomi, 
useful for a method according to claim 22. 

22. A method of making a Cointegrate DNA molecule, comprising 
combining in vitro: ^ 

(i) an Insert Donor DNA molecule, comprising a desired 
DNA segment flanked by a first recombination site and a 
second recombination site, wherein the first and second 
recombination sites do not recombine with each other; 

(ii) a Vector Donor DNA molecule cbntaining a third 
recombination site and a fourth recombination site, 
wherein the third and fourth recombination sites do not 
recombine with each other; and 

(iii) at least one site specific recombination protein capable of 

recombining said first and third recombinational sites said 

second and fourth recombinational sites; 
thereby aUowing recombination to occur, so as to produce a Cointegrate DNA 
molecule comprising said first and third or 'said second and fourth recombination 
sites. 



^ ^ according to claim 22, wherein a Product DNA 

molecule is produced from said Cointegrate DNA by recombining at least one of 
(i) said first and third, or (ii) said second and fourth, recombination sites, said 
Product DNA comprising said desired DNA segment 

24. A method according to claim 23, wherein said method also 
25 produces a Byproduct DNA molecule. 

25. A method according to claim 23. fiirther comprising selecting for 
the Product DNA molecule. 
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26. A method as claimed in claim 22, wlierein the Vector Donor DNA 
molecule comprises a vector segment flanked by said third and the fourth 
recombination sites. 

27. A method as claimed in claim 22, wherein the Vector £)onor DNA 
molecule fiirther comprises (a) a toxic gene and (b) a Selectable marker, wherein 
the toxic gene and the Selectable marker are on different DNA segments, the 
DNA segments being separated either by (i) in a circular DNA molecule, two 
recombmation sites, or (ii) in a linear DNA molecule, one recombination site. 

28. A method as claimed in claim 22, wdierein the Vector Donor DNA 
molecule further comprises (a) a repression cassette and (b) a Selectable marker, 
repressed by the repressor of the repression cassette, and \n*erein the Selectable 
marker and the repression cassette are on different DNA segments, the DNA 
segments being separated either by. (i) in a circular DNA molecule, two 
recombination sites, or (ii) in a linear DNA molecule, one recombination site. 

29. A method as claimed in claim 22, \\«ierein at lea^ one of the Insert 
Donor DNA molecule and the Vector Donor DNA molecule is a circular DNA 
molecule. . 



30, A method as claimed in claim 22, whereiu at least one of the Insert 
Donor DNA molecule and the Vector Donor DNA molecule is a linear DNA 
20 molecule. 



31. A method as claimed in claim 22, wherein the seleding step is 
carried out in vd>-o or m vfvo. 

32. A method as claimed in claim 22, whoein said recombination 
protein comprises at least a first recombination protein and a second 

25 recombination protein, said second recombination protein being different fiom 

said first recombination protein. 
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33. A method as claimed in claim 22, wherein said recombination 
protein is Int. 



34. A method as claimed in claim 22, wherein the at least one 
recombination pxotein is selected from (i) Int and IHF and 00 Int, Xis. and IHF. 
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